Join our Meetup group for more events! https://www.meetup.com/data-umbrella
- Transcript: https://github.com/data-umbrella/event-transcripts/edit/main/2022/58-emily-diego-wikimedia.md
- Meetup Event: https://www.meetup.com/data-umbrella/events/286553513/
- Video: https://youtu.be/mrHp3wc_6DQ
- Slides: https://docs.google.com/presentation/d/1EWOIdlagmIJx1_xhomj_vQ_ZHcTYPiRSLL-4FpkNZ-Q/edit?usp=sharing
- Transcriber: ? [needs a transcriber]
- Wikimedia Research Team: https://research.wikimedia.org/ Research Programs: https://research.wikimedia.org/projects.html
- Wikimania 2022 event: https://meta.wikimedia.org/wiki/Wikimania_2022
- Events: https://research.wikimedia.org/events.html Wiki Workshop: https://wikiworkshop.org/2022/
- User Diego: https://meta.wikimedia.org/wiki/User:Diego_(WMF)
Contact us at: [email protected] or via the #wikimedia-research IRC channel
You can fill out our interest form and we can contact you if we have an opportunity that aligns with your background: https://docs.google.com/forms/d/e/1FAIpQLScOSgCqRZW6e7l260BOM9O6FfL onIWDd6WtFUPwBYjcrS_SCA/viewform
Please review our privacy statement prior to sharing your information: https://foundation.wikimedia.org/wiki/Research_Team_Interest_Survey_Privacy_Statement
Wikimedia Foundation Internship Programs
- Outreachy (https://www.mediawiki.org/wiki/Outreachy)
- Google Summer of Code (https://www.mediawiki.org/wiki/Google_Summer_of_Code)
Tools / Resources:
- Developer portal: https://developer.wikimedia.org/
- Wikimedia Research Gitlab: https://gitlab.wikimedia.org/repo/research
- Wikimedia Research Fund: https://meta.wikimedia.org/wiki/Grants:Programs/Wikimedia_Research_%26_T echnology_Fund
- Notebook with examples on using Wikimedia data: https://github.com/digitalTranshumant/Wiki-examples/blob/master/WikiMedia PublicTools.ipynb
During this session, Diego Saez-Trumper and Emily Lescak from the Wikimedia Foundation Research team will discuss the history and structure of the Wikimedia Foundation and how the data science community can contribute to Wikimedia projects.
Dr. Emily Lescak is the Senior Research Community Officer at The Wikimedia Foundation, where she focuses on supporting, growing, and diversifying the global community of Wikimedia researchers. She began her data science career as a fisheries researcher in both the academic and government sectors. She transitioned full-time to non-profit program and community management work two years ago when she developed and managed Code for Science & Society's Event Fund, which provides financial and programmatic support to organizers of open data science events.
Diego Sáez Trumper is Chilean computer scientist, currently working as a Senior Research Scientist at the Wikimedia Foundation and Visiting Research Fellow at University Pompeu Fabra, where he obtained his PhD in 2013 under the supervision of Ricardo Baeza-Yates. Before, Diego worked as researcher and data scientist at NTENT, Eurecat, QCRI and Yahoo Labs. He has also been a visitor and collaborator of several universities such as UMFG (Brazil), Cambridge (UK), and UCU (Ukraine). His research focuses on the usage of data science to understand and deal with the diffusion of (dis)information in online platforms.
-
LinkedIn: https://www.linkedin.com/in/emily-lescak-b4652446/
-
Twitter: https://twitter.com/elescak
-
LinkedIn: https://es.linkedin.com/in/diego-s%C3%A1ez-trumper-5b24a2aa
-
Twitter: https://twitter.com/e__migrante
00:00 Beryl introduces Data Umbrella
05:40 Beryl introduces Emily
06:35 Beryl introduces Diego
08:20 Emily and Diego‘s introduction
09:30 Outline and agenda
09:45 The Wikimedia Foundation
12:15 Research priorities
14:42 Using Wikipedia resources
20:45 Article quality scores
23:30 Reminder on article versions
25:30 How to access content
29:58 Media Wiki Utilities
30:50 Quarry / SQL Replicas
33:38 Wikimedia Statistics
35:32 Page Views
40:02 Click Dataset – visitor information
42:20 Wikimedia Toolforge
43:45 PAWS: a Web Shell customized to interact with Wikimedia
45:10 How to contribute to Wikimedia (Emily)
49:15 How to stay in touch
50:45 Start of Q&A Session