- Python 3.6.7
- Selenium 3.141.0
- BeautifulSoup
Just run the script in terminal using python3 script.py
- Input.csv Contains input email ids to be searched on thumbtube.
- Output.csv contains the scraped data which includes first name, last name,linkedin url designation, company, location.
- Script.py Python script for scraping data.
- broweser_init()= initiates a headless firefox window.
- site_login()= logs in to test account.
- profile_search()= searches for profile on thumbtube.
- get_deatils()= extracts data from linkedin profile.
Currently the extraction is limited to 50 accounts daily because of lack of API access by Linkedin, contact me if you know how to bypass this limit! :)