## How to setup your own harvester The harvester works best with about 20 search-terms (called keyword here) organized below 4-6 topics. --- ### 1) Register at [github.com](https://github.com/signup) * Some instructions can be found [here](https://docs.github.com/en/get-started/signing-up-for-github/signing-up-for-a-new-github-account) * Register at [github.com](https://github.com/signup) with email & password & username (mine is kmicha) * Make sure to register for a free version; well simply don"t enter credit card infos * Maybe solve captcha * Check eMail and enter verification code * Sign in --- ### 2) Create an organisation at github.com * Sign in and go to your organization page. data:image/s3,"s3://crabby-images/c9996/c9996101d8e406000b22faca3d0f53f21c78b85c" alt="image of organisations" * Create a new organization with unique name - i.e. choose your username and append '-news', like 'kmicha-news' data:image/s3,"s3://crabby-images/12812/12812447c8619de6581253ebd2f4701da2bffee5" alt="image of organizations" * Choose free plan * Enter same eMail as for account * Choose 'My personal account' * Verify Captcha * No addons * Accept Terms * Add me (KMicha) to the organization * Customize members permission from 'read' to 'admin' --- ### 3) Change settings of organisation for actions at github.com * Go to settings data:image/s3,"s3://crabby-images/0f75e/0f75e45b07ea7dca2a43310b80dfb023e5924ff2" alt="image of organisations" * Open below 'Code, Planning, and automation' the 'Action' section and press 'General' - press SAVE data:image/s3,"s3://crabby-images/68d04/68d04da6d0cd7b82faf22a86325f40d9700732b4" alt="image of organisations" * pressed *SAVE* ? * Choose 'Allow all actions and reusable workflows' at the very top - press SAVE data:image/s3,"s3://crabby-images/630a7/630a740e24b59b37b50cfc939b63998bc4c92883" alt="image of organisations" * pressed *SAVE* ? * Choose below 'Workflow permissions' (at the very bottom) the 'Read and write permissions' - press SAVE data:image/s3,"s3://crabby-images/a72fa/a72fa537fa3766c7c56acafdd05aa281715a435c" alt="image of organisations" * pressed *SAVE* ? --- ### 4) Fork the [newsWhisperer/winterIsComing](https://github.com/newsWhisperer/winterIsComing) repository to your new organization * Goto [newsWhisperer/winterIsComing](https://github.com/newsWhisperer/winterIsComing), select 'fork' and 'Create a new fork' data:image/s3,"s3://crabby-images/01729/017293d2979bbba0e797e6518b54a79181d22332" alt="image of fork" * Choose your new organisation (with ending '-news') as owner (NOT THE USER NAME!) data:image/s3,"s3://crabby-images/6d684/6d684186aad85266ecdd8d6cb794812ebb8f0428" alt="owner of fork" * Press the green button --- ### 5) Add me to repository team with admin access * Goto the settings page and select the 'Collaborators and Teams' data:image/s3,"s3://crabby-images/d03a7/d03a70ea72ecd8d44e17de56af0cc87a27e63bd3" alt="add people" * Add me (KMicha) with admin access data:image/s3,"s3://crabby-images/1fc96/1fc96415fab0022113a01667f2b7f340f39a984d" alt="add kmicha" --- ### 6) Adapt the keywords and topics The harvester works best with about 20 search-terms (called keyword here) organized below 4-6 topics. * Goto to your code page and select the keywords.csv file data:image/s3,"s3://crabby-images/296b0/296b01c144dae192cbd76645d2dc205209a537c7" alt="edit keywords.csv" -* Edit the keywords file * Make sure to keep the first (header-)line unchanged * keyword: Your searchterm with single quotes. Can combine severall words, like 'this that'. Keywords should be unique. * language: Choose your language (must be supported by newsapi). i.e. de or en * topic: Severall keywords are combined into a common topic. No quotes. No spaces. * topicColor: The hexcode of the topic. Same topics, same color codes! * keywordColors: Ech keyword can have a seperate color hexcode. Might be similiar to it"s realted topic color. * limitPages: Will be dynamically adapted. Set to 1 initially * ratioNew: will also be adapted. 1 is also fine. Colorpickers can be googled. Or use one of these: [redketchup](https://redketchup.io/color-picker), [w3schools](https://www.w3schools.com/colors/colors_picker.asp) data:image/s3,"s3://crabby-images/db5f1/db5f15051d6480d24fd3610056f0707c57c641a5" alt="edit keywords.csv" Attention: You can adapt the keywords later, but for now you should not remove any, once you've started harvesting! * Submit the changed file. Description can be empty. Commit directly to main branch. Press the green button. data:image/s3,"s3://crabby-images/845f9/845f9300d2e945ea41001ffec9f8464134dcebd5" alt="edit keywords.csv" --- ### 7) Remove existing news_21yy_mm.csv files Inside your repository, goto code, then inside csv folder, select news_2022_mm.csv file (one after the other) and delete it. After each deletion,a commit must be done. data:image/s3,"s3://crabby-images/d65af/d65afed4ad6a66e33e8b8344caa0488e40a190c3" alt="goto csv" data:image/s3,"s3://crabby-images/243c8/243c8d5a1701e2953c4c6e1ddc60f405b72313a0" alt="select csv" data:image/s3,"s3://crabby-images/3ea62/3ea6226d93b88596f343e2fdf194acc1ae16070b" alt="delete csv" data:image/s3,"s3://crabby-images/3fe96/3fe966d4c8329f2a0310502ff71f1645fbaceec3" alt="submit csv" --- ### 8) Run Action : check:0.0 * Go to the Action tab data:image/s3,"s3://crabby-images/f0e42/f0e42658bd0b82a53bd86d0c21dc29bff3dafeda" alt="image of fork" * Select the check:0.0 workflow on the left panel * Run workflow on the right side * Wait for about 2-3 minutes for completion --- ### 9) Check CHECK.md * Go to the Code tab data:image/s3,"s3://crabby-images/c1a94/c1a944d56884017f1ea6a99feb37e4afb640872d" alt="image of fork" * Click on the CHECK.md file * Follow the instructions for red errors * Repeat step 8) & 9) until all fields are green --- ### 10) Enable other actions * Goto action tab data:image/s3,"s3://crabby-images/133bb/133bb38c0eae10d9f4f785d1268017d560e2c879" alt="image of fork" * Enable all actions: bayes/diagrams/entities/harvest