Skip to content

saneshashank/Reference-Links

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation

Reference-Links

This repository contains a curated list of articles for ML/AI/NLP. I have browsed many of these articles while researching on problems that I have faced while applying ML to industry problems and could be useful to other practitioners as well. Although I have tried to arrange the topics at high level, there is not strict sequence to the references present.

General trivia about basic DS libraries


Inferential Statistics

General

Frequentist AB testing:

ANOVA tests:


Machine Learning

General

Feature Scaling

Feature reduction/Feature Selection:

dummy vars

Pipelines in sklearn

Boosting:

Metrics for ML model evaluation

Dimensionality Reduction

PCA

t-SNE

ICA

Model Stacking

Distance Metrics

Clustering

Handling imbalanced data set:

Handling Skewed data:

Multi-label Classification

Probability Callibration:

Sparse Matrix

Design of Experiment

Model Explainibility/Interpretable ML/ Fairness in AI/ Responsible AI

Anomaly Detection

Semi Supervised Learning & Active Learning

Model Fracking and Concept Drift:

Time Series

Open Datasets

XGBoost Installation:

  • check you python version - by opening CMD and typing python -> ENTER
  • Go to this link and search on XGBoost: https://www.lfd.uci.edu/~gohlke/pythonlibs/
  • download the installable based on python version + Windows 32 or 64 bit, for example download xgboost-0.71-cp36-cp36m-win_amd64.whl for python version 3.6 and 64 bit machine.
  • open cmd in downloaded location and run the following command: pip install xgboost-0.71-cp36-cp36m-win_amd64.whl

Deep Learning:

General

CNN and Image Processing:

1D - CNNs

Keras Embedding Layer

Keras generators

Saving Keras Models

Clustering using DL

Large Model Support usage in keras

Image Captioning

Image Segmentation:


Natural Language Processing (NLP) and Natural Language Understanding (NLU)

General

Text Classification using Deep Learning:

Spacy resources

Topic Modelling:

Top2Vec

Text Summarization:

keyword-phrase extraction/keyphrase extraction/phrase extraction

Gensim

Natural Language Understanding

NLG

EVT

Doc2Vec

Information Retrieval, text search & semantic search:

Longform Question Answering

NLP spell correction:

Transfer learning in NLP:

Latest Language Models usage & applications

Text Data Augmentation


Advanced Topics

Knowledge Graphs

Deep Learning and Graphs:

Geometric Deep Learning & Graph learning.

Text GCN:

RBF Neural Networks

Graph Neural Networks (GNN) good resources

Probabilistic programming in tensorflow

Bayesian Optimization

Variational Autoencoder

Reinforcement Learning

PGM/ Causal Inference in ML

Information Theory of Deep Learning

Bayesian Deep Learning

Kalman Filters

Geometric deep learning

Neuraxle

Math & Deep learning

Multitask Learning (MTL)

Weak Supervison & Semi-Supervised Learning

AutoML

Future research topics


ML Engineering

Docker

Advanced/Intermediate Python

OOPS & others

writing better code for DS:

Generators

Data products

Architecture considerations

Streamlit

programming environments

Additional steps for nbdev on Windows 10 (so that make docs_serve command runs and documentation is visible locally)

You might also have to add Git (C:\Program Files\Git\bin) to Path in windows system variables.

  • Installing Ruby & Jeykyll:
    • Install Ruby: https://rubyinstaller.org/
    • Install the jekyll and bundler gems:gem install jekyll bundler
    • go into the docs folder S:\deck_of_cards\docs (in this folder you will find the Gem and run the following command: bundle install`

That's it this will complete setup, now make docs_serve can be run

Dashboarding in jupyter notebook

The missing CS semester

writing your own blog:

Visual C++ build tools:

Chatbots

Scaling ML projects:

Spark

Scaling Pandas: Comparing Dask, Ray, Modin, Vaex, and RAPIDS

https://www.datarevenue.com/en-blog/pandas-vs-dask-vs-vaex-vs-modin-vs-rapids-vs-ray

Modin

Dask

Ray

Prefect

Project Structure

MLOPs

Machine Learning Design Patterns


References (Usecase specific applications):

Some good sites to follow

About

Curated list of articles for ML/AI/NLP

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published