Skip to content

Issues: instructlab/sdg

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Add precomputed dataset to skills data generation enhancement New feature or request
#171 opened Jul 19, 2024 by bbrowning updated Nov 22, 2024
Set a default seed value for gen_kwargs enhancement New feature or request
#169 opened Jul 18, 2024 by russellb updated Nov 22, 2024
Incorrect identify system prompt while generate SDG
#406 opened Nov 22, 2024 by alfandindarahmawan updated Nov 22, 2024
Add proper typehints
#379 opened Nov 14, 2024 by RobotSail updated Nov 25, 2024
SDG agnetic pipeline documentation
#307 opened Oct 14, 2024 by relyt0925 updated Nov 26, 2024
Add LeafNode class good first issue Good for newcomers
#292 opened Sep 30, 2024 by khaledsulayman updated Dec 5, 2024
Checkpoint files make iterating on a taxonomy awkward UX Affects the User Experience
#245 opened Aug 16, 2024 by bbrowning updated Dec 11, 2024
Support more than 3 qna per context chunk enhancement New feature or request
#232 opened Jul 30, 2024 by markmc updated Dec 11, 2024
Include precomputed dataset and datamixing recipes enhancement New feature or request
#234 opened Aug 5, 2024 by aakankshaduggal updated Dec 11, 2024
Simplify base_document column usage with auxiliary instructions in pipeline config enhancement New feature or request
#228 opened Jul 29, 2024 by bbrowning updated Dec 11, 2024
checkpointing: consider allowing users to specify save frequency enhancement New feature or request
#225 opened Jul 28, 2024 by markmc updated Dec 11, 2024
Make generate_data(batch_size=None) default to a batch size of 8 refactor Same results, different method
#224 opened Jul 27, 2024 by markmc updated Dec 11, 2024
INFO logging seems more like DEBUG refactor Same results, different method
#223 opened Jul 27, 2024 by danmcp updated Dec 12, 2024
replace v1/completions with v1/chat/completions
#290 opened Sep 26, 2024 by aseelert updated Jan 7, 2025
Make use of structured output with vLLM enhancement New feature or request
#481 opened Jan 16, 2025 by russellb updated Jan 21, 2025
Remove chunk_word_count and chunk approximations enhancement New feature or request
#429 opened Dec 5, 2024 by bbrowning updated Jan 21, 2025
Control maximum parallel requests to inference server
#424 opened Dec 2, 2024 by bbrowning updated Jan 21, 2025
Run test scripts in CI CI/CD Affects CI/CD configuration testing Relates to testing
#56 opened Jun 30, 2024 by russellb updated Jan 21, 2025
Consider moving to chat/completions API enhancement New feature or request
#87 opened Jul 5, 2024 by markmc updated Jan 21, 2025
Remove system prompt from data generation refactor Same results, different method
#96 opened Jul 8, 2024 by oindrillac updated Jan 21, 2025
Support --pipeline=/path/to/pipeline.yaml with a "direct router" enhancement New feature or request
#134 opened Jul 15, 2024 by markmc updated Jan 21, 2025
Document and add examples for each Block type documentation Improvements or additions to documentation
#415 opened Nov 27, 2024 by bbrowning updated Jan 23, 2025 0.8.0
ProTip! Exclude everything labeled bug with -label:bug.