conda create -n "autoblogger_wordpress"
conda activate autoblogger_wordpress
pip3 install -r requirements.txt
Copy the .env.example file into a file called .env in the root of the directory. Here are the fields to change:
- WP_MEDIA: Change the base url in the WP_MEDIA line in the .env file to your website.
- Go to add new plugin in wordpress. Search for "application passwords" and install the first app.
- Create a new application in the application passwords app and input WP_APPLICATION_USERNAME and WP_APPLICATION_PASSWORD in the .env file.
- WP_CATEGORIES: replace base url with your site.
- WP_TAGS: replace base url with your site
- UNSPLASH_API_KEY: Make an api key on unsplash.com and replace your api key in the .env file.
- SERPER_API_KEY: Make an api key on serper.dev and replace your api key in the .env file.
- OPENAI_API_KEY: Make an api key on openai.com and replace your api key in the .env file.
- WP_BASE: Change to your base url.
- WP_POSTS: Change to your posts url.
- GPT_USERNAME: Add gpt username for selenium login
- GPT_PASSWORD: Add gpt password for selenium login
Now you should create a CSV file in csvs/ with a list of your keywords the app will use. You can easily create a compatible CSV file by exporting a keywords list from ahrefs.com. Otherwise you can create a CSV by copying the probiotics.csv which is already in the directory. The CSV file should contain a column that is labeled Keyword (with capital K). No other columns are needed.
There are two arguments needed to start the app.
- keyword: this is the name of your CSV file with a list of keywords you would like to create articles for. Since the CSV file given with the repo is called probiotics.csv, to run the keywords in the file you would call "--keyword "probiotics"
- category: this is the wordpress category that will be used for the articles. If it is not already created in wordpress the category would be created. Example: --category "Health & Household"
Now run the app like this:
python3 gpt_keyword.py --keyword "probiotics" --category "Health & Household"
- The app will scrape the first page of organic google search results using serper.dev and playwright python.
- The app will summarize each result using Chat GPT and combine the summaries.
- The app will come up with a slug, tags, title and ~10 headings for the article based on the content of the article summaries.
- The app will search the tags on unsplash and get the first image for the featured image in your article.
- The app will make an intro paragraph based on the langchain RetrievalQA of the keyword based on the article summaries.
- The app will fill out the article using the headings and RetrievalQA of the headings given the article summaries. This way everything will fit in the context window of the chat gpt turbo or chat gpt turbo 16k while also getting relevant information for each section.
- Tags and category are dealt with using functions available in the app.
- The app will log the keywords which are used in the csvs/finished/ folder with the keyword as the csv name. Example: probiotics.csv
- Aug 23 11:38pm EST - Fixed bug in section schemas to parse out extra heading
- Aug 23, 7:00pm EST -- Updated section schemas to parse out extra heading.
- Aug 23, 4:43am EST -- Just updated section schemas to add better parsing.
- Aug 23, 2:47an EST -- Just updated intro schemas to add better parsing so the section will start on the right character.
- If anyone can change the repo to get more relevant unsplash photos for each post that would be much appreciated. The photos are generally not as related to the post as they should be.