Installation

pip install -r requirements.txt

Models and Datasets

Datasets are from alpaca_farm
Models: ChatGLM2, ChatGLM3 and Phi-2(Microsoft).

Probe

main.py

Call probe_dist() to get probabilities and ranks for the given QA-pairs. Results are saved in qa_status.

This project is tested on alpaca-preference dataset, so many methods are designed according to the special format of this dataset.

I plan to adjust the methods in the DistributionProbe class to abstract the processing of datasets, support streaming output, and provide overall probability distribution for individual tokens.

Visualize

analyze.py

Different color represents tokens with different probabilities and ranks predicted by certain model. In the default setting, darker tokens have lower ranking and vice versa.

White, green, yellow and red represent increasingly lower predicted probabilities in turn. class controller reads the file in qa_status and displays results in the terminal.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
img		img
qa_status		qa_status
src		src
README.md		README.md
analyze.py		analyze.py
main.py		main.py
paint.py		paint.py
requirements.txt		requirements.txt
train_rm.py		train_rm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Models and Datasets

Probe

main.py

Visualize

analyze.py

About

Releases

Packages

Languages

lsjlsj35/LLM-Distribution-Probe

Folders and files

Latest commit

History

Repository files navigation

Installation

Models and Datasets

Probe

main.py

Visualize

analyze.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages