Pinned Loading
-
Reinforcement-Calibration-SimCSE
Reinforcement-Calibration-SimCSE PublicReinforcement Calibration SimCSE, combining contrastive learning, artificial potential fields, perceptual loss, and RLHF to achieve improved Semantic Textual Similarity (STS) embeddings. PyTorch-baβ¦
Python 10
-
event-timeline-generation-olympics
event-timeline-generation-olympics PublicA toy system for generating event timelines from social media data, specifically focusing on the Olympic Game medalist events.
Jupyter Notebook 6
-
byte_pair_encoding_BPE_subword_tokenization_implementation_python
byte_pair_encoding_BPE_subword_tokenization_implementation_python PublicByte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python
Python 13
-
Logic-RL-Lite
Logic-RL-Lite PublicLightweight replication study of DeepSeek-R1-Zero. Explores pure RL without SFT for post-training for reasoning capability. No "Aha moment" and "Longer CoT β Accuracy".
Python 1
148 contributions in the last year
Day of Week | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | |||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |