Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Events scraper #36

Closed
qasim opened this issue Apr 12, 2016 · 5 comments
Closed

Events scraper #36

qasim opened this issue Apr 12, 2016 · 5 comments

Comments

@qasim
Copy link
Member

qasim commented Apr 12, 2016

https://www.events.utoronto.ca/

@qasim
Copy link
Member Author

qasim commented Apr 12, 2016

Proposed schema:

{
  "id": String,
  "title": String,
  "date": {
    "start": String,
    "end": String,
  },
  "url": String,
  "description": String,
  "campus": String,
  "address": String,
  "audience": [String]
}

@g3wanghc
Copy link
Member

@qasim I'm down to work it. :V

@qasim
Copy link
Member Author

qasim commented Apr 15, 2016

@g3wanghc Awesome. Here's a little tweaked schema that I had in mind, keeping with conventions from other scrapers:

{
  "id": String,
  "title": String,
  "date": String,
  "start_time": String,
  "end_time": String
  "url": String,
  "description": String,
  "campus": String,
  "address": String,
  "audience": [String]
}

date would be ISO 8601 format like 2016-04-15.
start_time and end_time would also be ISO 8601 like 2016-04-15T12:00:00-04:00 (standardized for eastern timezone).
url would just be the URL of the event posting, i.e. https://www.events.utoronto.ca/index.php?action=singleView&eventid=12052.

My thinking is each scraper run should go through all the pages of events (so every event currently listed) and then open each link inside to grab the more detailed information to complete the schema.

Let me know what you think!

@g3wanghc
Copy link
Member

Sure, looks like fun. 👍

@g3wanghc
Copy link
Member

@qasim Do we care about Admission Price, Contact Info, Website and Event Sponsor?

@qasim qasim mentioned this issue Apr 19, 2016
qasim added a commit that referenced this issue Apr 19, 2016
@qasim qasim closed this as completed Apr 19, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants