YC Company Scraper is a Python script that fetches information about Y Combinator companies from specified batches, extracts GitHub repository links from their websites, and prints the data.
- Retrieve company information from various Y Combinator batches
- Scrape GitHub repository links from company websites
- Print the extracted data
- Python 3.6 or higher
requests
library
- Clone this repository:
git clone [email protected]:dirkjbreeuwer/yc-scraper.git
cd yc-company-scraper
- Install the
requests
library:
pip install requests
-
Open
yc_company_scraper.py
and modify thebatches
list to include the desired Y Combinator batch codes. -
Run the script:
python yc_company_scraper.py
- The script will print the company name, website, YC URL, and any GitHub repository links found on their website.
This script is for educational purposes only. Always respect the websites' terms of service and robots.txt when scraping data.