Skip to content

Latest commit

 

History

History
35 lines (20 loc) · 1.07 KB

README.md

File metadata and controls

35 lines (20 loc) · 1.07 KB

Logo

Data Science for Beginners

A collection of Jupyter Notebooks, HTML and JS code for Data Scientists.

Comments on the single scripts can be found on my Medium blog as well as on my Web site.

Programming Languages

  • Python 3.X
  • HTML5
  • Javascript, with a focus on D3.JS
  • CSS

Other interesting tutorials can be found on my Observable Profile.

Authors

Documentation

The Project is organised in different folders, one covered topic:

  • Data Collection - data extraction from HTML, Twitter, PDF ...
  • Preprocessing - missing data, duplicates, normalisation, binning ...
  • Data Analysis - full workflow in scikit-learn and PyCaret, as well as overfitting, Auto ML, ...
  • Text Analysis - sentiment analysis, ...
  • Data Visualisation - examples in Altair, Plotly, D3.js, ...
  • Data Narrative - how to improve data visualisations.