Sign Language Recognition using Machine and Deep Learning
- Classifies the American Sign Language to english letters in real time by taking video input using webcam
- I3D Transfer Learning : We worked on classifying most common words in English from sign language. For this, we used the WLASL dataset and fine tuned the I3D model which was pretrained on the Kinetics-400 dataset
- CNN + RNN with self made dataset : We Created our own small dataset for 10 common words. Each word had 30 videos of 30 frames. Here we extracted the landmarks of pose, left hand and right hand using mediapipe holistic model. And used those landmarks of the frames for training the RNN model. This Model works accurately for the person who trained but the accuracy drops for the other persons
- Transfer Learning with Resnet & Mobilnet : Trained on ASL dataset (consisting of sign languages of A-Z letters and some special characters like SPACE, DELETE and NOTHING). Predicts accurately on the training, validation and testing dataset but the accuracy drops in production
- MediaPipe Feature Extraction + Machine Learning Algorithms : Using MediaPipe's Hand Model , features are extracted and engineered. Those are fed to machine learning model and are used to predict the labels
-
- WLASL is the largest video dataset for Word-Level American Sign Language (ASL) recognition
- This dataset consists of around 20K videos for sign language representation of around 2000 commonly used words
-
American Sign Language Dataset for letters
- Image data set for alphabets in the American Sign Language
- Contains 87000 images of 200 X 200 pixels, divided into 29 classes (A - Z, SPACE, DELETE and NOTHING)
- The test data set contains only 29 images, 1 image per labels.
-
American Sign Language Dataset for letters
- RGB image dataset of American sign language alphabets.
- This dataset contains 166k images of 200 X 200 pixels, divided into 28 classes (A - Z, SPACE and NOTHING)
- The test data set contains 112 images , 4 images per labels.
- Action Recognition
- Hand Feature Extraction on
WLASL
Dataset & feeding the output to RNN for Word Sign Classificaion
- Hand Feature Extraction on
- ASL Alphabet detection using Deep Learning models
- ASL Alphabet detection
- Sign Language detection
- Sign Language detection - 2
- ASL model testing
- Clone the repository using the
git clone
$ git clone https://github.com/utsavk28/Nerds.git
- Create a virtual environment
$ virtualenv venv
$ source venv/bin/activate
- Install the required packages
$ pip install -r requirements.txt
- Run the app
$ python main.py