Skip to content
View Rishabbh-Sahu's full-sized avatar

Block or report Rishabbh-Sahu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Rishabbh-Sahu/README.md

Hi πŸ‘‹, I'm Rishabbh

Data scientist and consultant, having experience across multiple industries like retail, manufacturing, FMCG, banking(fintech), insurance, consumer business etc.

I'm currently:
  • πŸ”­ Working on deep learning projects such as transfer learning, pre-trained models (transformers), incremental models, layer pruning, quantization & distillation, language models to solve NLP downstream tasks like document parsing pipeline (banking sector), summarization, Q&A, NLU/NLG, context based auto-completion, RestAPI's flask-endpoint, deployment & productionization, docker images & containers, kubernetes, GCR-GCP, CI/CD pipeline, gitHub hooks/actions(pre/post commit, workflows), DVC-pipeline, mongoDB, google-OCR, label studio, Transformer's model interpretability etc.

  • 🌱 Learning autoencoders, self-supervised learning, optimization, time series analysis using deep learning (deepstates model-gluonTS), linear programming (optimization), anomaly detection, feature learning, data comprssion techniques(SVD, matrix factorization), MLOps (ML pipeline), model interpretability (Explicable-AI), ablation study, TextRank - grpah representation of text with PageRank

  • πŸ‘€ Interested in doing research work/consulting assignments by sharing, learning and exploring to/from open source communities. Motivated to create numerous projects in the field of AI/ML/DL with the focus to deploy into production.

  • πŸ‘― Looking forward to collaborate on ML/DL projects and Kaggle competitions


About my work:
  • πŸ‘¨β€πŸ’» All of my projects are available at https://github.com/Rishabbh-Sahu

  • πŸ’¬ Ask me about NLU/NLP, intent/sequence/text/email classification, NER (named entity recognition), sentence/document/semantic similarity, information retrieval using tfidf/context-based, time series analysis (forecasting), deep learning, data augmentation, dialog system (voice models), PLM's (pre-trained language models), model ensembling/stacking, feature selection methods, segmentation, tokenization (text), optimization, recommendation engine, customer-360 analysis, statistics, retail analytics, supply chain analytics, dimensionality reduction, regularization techniques, bias & varaince, DOE-design of experiments (ANOVA,T/F/Chi^2/Z-test), KS-test, sampling methods, A/B testing, crowd-sourcing (Toloka, Mtark etc.), Big-query, SQL

  • πŸ“« You can reach me on www.linkedin.com/in/rishabbh-sahu-pmp



Languages and Tools:

python tensorflow R aws azure git linux gcp mysql sqlite hadoop hive jenkins mssql oracle postman c cplusplus



rishabbh-sahu

rishabbh-sahu's GitHub stats

Pinned Loading

  1. intent_and_slot_classification intent_and_slot_classification Public

    One of the main NLU tasks is to understand the intents (sequence classification) and slots (entities within the sequence). This repo help classify both together using Joint Model (multitask model).…

    Python 15 1

  2. information_retrieval information_retrieval Public

    Given a document, identifying the closest documents within the list of documents using tf-idf matrix and cosine similarity

    Python 4 2

  3. semantic_lookalike_transformers semantic_lookalike_transformers Public

    Finding look alike sentences by leveraging the concept of semantic similarities pre-learned by transformer models while pre-training task. I've used cosine similarity as an angular distance matrix …

    Python 3 4

  4. linear_programming linear_programming Public

    Solving linear programming using python optimizer interface. This repo allows you to add multiple decision vars and constraints etc. in a very easy way.

    Jupyter Notebook 2

  5. ignore_email_address_classifier ignore_email_address_classifier Public

    Important vs Ignore email classifier based on incoming email addresses. Bert tokenizer is used as a tokenization method and CNN network as the model. Framework used - Tensorflow 2.4

    Python 1

  6. learning_pathways learning_pathways Public

    A repository consists of various links which helps understanding a concepts, algorithm etc.