-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
audit scraping tutorial (and audit the HTML) #23
Comments
Hi Hannes, This is what i included so far, i can not add it in here since it is a jupyter notebook. I will send the right version via email (since it looks like images are not working well in the colab), but will add a google colab in here two: https://colab.research.google.com/drive/1F64Po-c3weJAm_ZrABAQRBJzXS-Qj5y4?usp=sharing I did not include the recently played or top 10 songs for users and artist since i only can scrape the table as a whole but i have code for that if we want to include it later. I can also remove things if some things are already too extensive. |
Hi @fleurlemire - please commit your work directly on our github repository for this project. You can create a new folder (say: tutorials) as a root directory. Let me know please. |
Hi @hannesdatta, when i try to, i get an error message saying permission denied when I try to commit. |
You should now have push access. Can you try again? |
It is added! @hannesdatta |
Hi Hannes, i added some extra information, including how to save the information and uploaded it. |
Background:
We've built our site so others can learn how to scrape. But, we've never actually tried scraping it ourselves!
The purpose of this task is to build a "scraping tutorial" for the site, BUT ALSO revise our HTML templates to make the site "scraping-friendly".
We need to ensure that we cover a range of "identifiers" to get data from the site. This should be
Further, we need to ensure students can extract information (1) from the TEXT attributes of HTML, (2) as well as from attribute-values.
Deliverable:
BeautifulSoup
. As an example, see this tutorial.Next steps:
The text was updated successfully, but these errors were encountered: