Skip to content

Commit

Permalink
Apply automatic changes
Browse files Browse the repository at this point in the history
  • Loading branch information
mdingemanse authored and github-actions[bot] committed May 8, 2024
1 parent 2a4ca7d commit e7800cf
Show file tree
Hide file tree
Showing 3 changed files with 5 additions and 5 deletions.
2 changes: 1 addition & 1 deletion docs/df.csv
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ https://www.microsoft.com/en-us/research/project/orca/,"This file applies to Orc
https://huggingface.co/CohereForAI/c4ai-command-r-v01,,,Aya Collection,CC-BY-NC and C4AI acceptable use policy,Cohere AI,https://cohere.com,,closed,,"No codebase available to study or adjust model architecture, training, or inner workings.",closed,https://docs.cohere.com/docs/data-statement,"No documentation, listing or audit of pre-training data available. Cohere itself identifies it as coheretext-filtered and gives the size as 200Gb.",closed,,No checkpoint or model prior to SFT and instruction-tuning made available,open,https://huggingface.co/collections/CohereForAI/aya-datasets-660415741bd4852f01c81c77,Aya Collection (Aya Open Science initiative) is a multilingual collection of 513 million instances of promts and completions including instruction-style templates.,open,https://huggingface.co/CohereForAI/c4ai-command-r-v01/tree/main,Fine-tuned model weights made available for download,partial,https://docs.cohere.com/docs/c4ai-acceptable-use-policy,Licensed under CC-BY-NC and requires agreeing to C4AI acceptable use policy,closed,,"No source code available, so no documentation of code.",closed,,Architecture only sparsely documented.,closed,,No preprint appears to be made available at this time.,closed,,No paper known to document the Cohere Command R+ model or architecture.,partial,https://huggingface.co/CohereForAI/c4ai-command-r-v01-4bit,"Model card on HF document some aspects but provides no data on training data, instruction-tuning methods",closed,,Datasheet not available.,closed,,No separate package available.,closed,,API access available only when signing up.,/projects/command-r.yaml,3.0
https://ai.google.dev/gemma/docs,Model weights and developer tools,Gemma,Unspecified,,Google DeepMind,https://ai.google.dev/gemma/docs,,partial,https://github.com/google-deepmind/gemma,No pre-training or instructing-tuning code made available. Some developer tools available.,closed,,No details provided on pre-training data.,partial,https://www.kaggle.com/models/keras/gemma/frameworks/keras/variations/gemma_7b_en,"Base model weights shared via Kaggle, requires privacy-defying access request.",closed,,"Documentation says 'These versions of the model are trained with human language interactions and can respond to conversational input, similar to a chat bot.' ",partial,,"Instruction-tuned model weights shared via Kaggle, requires privacy-defying access request",closed,https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/335?pli=1,"Bespoke Gemma Community License Agreement and restrictive Terms of Use, neither open in the sense of OSI. Only Inference code shared under Apache 2.0.",closed,,No pretraining or finetuning code found. No code documentation except for deployment of the open weights.,partial,https://www.kaggle.com/models/google/gemma,Architecture described in very general terms in model card,closed,,No preprint found,closed,,No paper found,open,https://www.kaggle.com/models/google/gemma,Model card on Kaggle provides some detail on model internals and evaluation,closed,,"No datasheet found, pre-training and instruction-tuning data nowhere specified.",closed,,"No package provided, access is gated through Kaggle, Vertex Model Garden, Google Cloud",closed,,"No API provided, access gated through Kaggle, Vertex Model Garden, Google Cloud",/projects/gemma-instruct.yaml,3.0
https://ai.meta.com/resources/models-and-libraries/llama/,,LLaMA2,"Meta, StackExchange, Anthropic",Unclear,Facebook Research,https://github.com/facebookresearch,,closed,https://github.com/facebookresearch/llama/tree/main,"Repository only offers 'a minimal example to load Llama 2 models and run inference'; no training, fine-tuning, evaluation code made available",closed,,"Data nowhere disclosed or documented, and described only in the vaguest terms in a corporate preprint released by Meta",partial,https://ai.meta.com/resources/models-and-libraries/llama-downloads/,Download only after requesting access; requires signing a consent form,closed,,RLHF data including 1 million Meta-specific tuning prompts not made available (even as it incorporates some open RLHF datasets),partial,https://ai.meta.com/resources/models-and-libraries/llama-downloads/,Download only after requesting access; requires signing a consent form,closed,https://github.com/facebookresearch/llama/blob/main/LICENSE,"Usage requires signing Meta's bespoke 'community license', not an OSI recognised open license",closed,,Code only covers minimal examples; no documentation available.,partial,https://ai.meta.com/research/publications/llama-2-open-foundation-and-fine-tuned-chat-models/,"Architecture sketched in preprint, though many details missing.",partial,https://ai.meta.com/research/publications/llama-2-open-foundation-and-fine-tuned-chat-models/,"Corporate preprint quite some detail on pretraining, RLHF, and safety measures but none on datasets.",closed,,No peer-reviewed paper found,partial,https://github.com/facebookresearch/llama/blob/main/MODEL_CARD.md,"There is a model card, but it provides the absolute minimum of detail, and none whatsoever on training data.",closed,,Datasheet not provided.,closed,,Package not provided,partial,,API only available behind a privacy-defying signup form,/projects/llama-2-chat.yaml,3.0
https://huggingface.co/Nanbeige/Nanbeige2-8B-Chat,Comes in 8B and 16B versions,Unknown,Unknown,Apache 2.0 and bespoke community license,Nanbeige LLM lab,https://huggingface.co/Nanbeige,,open,https://github.com/Nanbeige/Nanbeige,"github repo contains sparse but clear code for training, tuning, and inference",closed,,No information on pre-training datasets except a claim of 4.5T tokens,closed,,Base model not shared,closed,,No information on finetuning and DPO datasets,open,https://huggingface.co/Nanbeige/Nanbeige2-8B-Chat/tree/main,Model weights for finetuned model shared,partial,,Apache 2.0 but commercial use requires signup and an additional community license,closed,,No documentation of the codebase,closed,,Architecture not clearly specified,closed,,No preprint found,closed,,No paper found,closed,,No model card found,closed,,No datasheet found,closed,,No package found,partial,https://huggingface.co/spaces/Nanbeige/Nanbeige-Plus-Chat-v0.1,"No API, but HuggingFace space available",/projects/nanbeige-chat.yaml,3.0
https://huggingface.co/Nanbeige/Nanbeige2-8B-Chat,Comes in 8B and 16B versions,Unknown,Unknown,Apache 2.0 and bespoke community license,Nanbeige LLM lab,https://huggingface.co/Nanbeige,,open,https://github.com/Nanbeige/Nanbeige,"github repo contains sparse but clear code for training, tuning, and inference",closed,,No information on pre-training datasets except a claim of 4.5T tokens. Request for information on HF community was closed without comment.,closed,,Base model not shared,closed,https://huggingface.co/Nanbeige/Nanbeige2-8B-Chat/discussions/2#6621e15a4d17641cf788cbd5,"No information on finetuning and DPO datasets. Some information provided on request (see link), but official documentation not updated.",open,https://huggingface.co/Nanbeige/Nanbeige2-8B-Chat/tree/main,Model weights for finetuned model shared,partial,,Apache 2.0 but commercial use requires signup and an additional community license,closed,,No documentation of the codebase,closed,,Architecture not clearly specified,closed,,No preprint found,closed,,No paper found,closed,,No model card found,closed,,No datasheet found,closed,,No package found,partial,https://huggingface.co/spaces/Nanbeige/Nanbeige-Plus-Chat-v0.1,"No API, but HuggingFace space available",/projects/nanbeige-chat.yaml,3.0
https://huggingface.co/collections/meta-llama/meta-llama-3-66214712577ca38149ebb2b6,,Meta Llama 3,"Meta, undocumented",Meta Llama 3 Community License,Facebook Research,https://github.com/facebookresearch,,closed,https://github.com/meta-llama/llama3,Repository only offers minimal code,closed,,"Data nowhere disclosed or documented, and described only in the vaguest terms in a release blog post",partial,https://ai.meta.com/resources/models-and-libraries/llama-downloads/,Download only after requesting access; requires signing a consent form,closed,,No information available on instruction-tuning.,partial,https://ai.meta.com/resources/models-and-libraries/llama-downloads/,Download only after requesting access; requires signing a consent form,closed,https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct/tree/main,"Even inspecting the model requires signing Meta Llama 3's bespoke 'community license', not an OSI recognised open license",closed,,Code only covers minimal examples; no documentation available.,partial,https://ai.meta.com/research/publications/llama-2-open-foundation-and-fine-tuned-chat-models/,Architecture sketched in glossy blog post.,closed,,No preprint or any other scientific documentation available.,closed,,No peer-reviewed paper available.,partial,https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md,"There is a model card, but it provides the absolute minimum of detail, and none whatsoever on training data.",closed,,Datasheet not provided.,closed,,Package not provided,partial,,API only available behind a privacy-defying signup form,/projects/llama-3-instruct.yaml,2.5
https://huggingface.co/upstage/SOLAR-0-70b-16bit,HuggingFace profile says 'Solar is a great example of the progress enabled by open source.',LLaMA2,"Orca-style, Alpaca-style","Meta Community license, CC-BY-NC",Upstage AI,https://en.upstage.ai/,Korean venture,closed,,No code repository found,closed,,"Data nowhere disclosed or documented, and described only in the vaguest terms in a corporate preprint released by Meta",partial,https://ai.meta.com/resources/models-and-libraries/llama-downloads/,Download only after requesting access; requires signing a consent form,closed,,"No RLHF datasets specified or shared, docs say 'Orca-style dataset, Alpaca-style dataset'",partial,https://huggingface.co/upstage/SOLAR-0-70b-16bit/tree/main,Finetuned checkpoints only shared through CC-BY-NC,closed,https://huggingface.co/upstage/SOLAR-0-70b-16bit#model-details,"Meta Community License for base model, and CC-BY-NC 4.0 for fine-tuned model weights",closed,,HuggingFace code only comprises configuration json; no documentation available.,closed,https://huggingface.co/upstage/SOLAR-0-70b-16bit,"Precise architecture, training, fine-tuning procedures not given.",closed,,No preprint or any form of scientific docuentation found.,closed,,No peer-reviewed paper found,partial,https://huggingface.co/upstage/SOLAR-0-70b-16bit,"HuggingFace model card used mostly as advertising, omits many details on training, fine-tuning, evaluation.",closed,,Datasheet not provided.,closed,,Package not provided,partial,,API only available by signing up for 'private LLM' service,/projects/solar-70B.yaml,2.0
https://huggingface.co/Xwin-LM/Xwin-LM-7B-V0.1,Xwin-LM aims to develop and open-source alignment technologies for large language models,LLaMA2,unknown,Llama 2 license,Xwin-LM,https://github.com/Xwin-LM,Xwin-LM aims to develop and open-source alignment technologies for large language models,closed,https://huggingface.co/Xwin-LM/Xwin-LM-7B-V0.1,"HuggingFace page notes 'to do "":"" Release the source code'",closed,,"Data nowhere disclosed or documented, and described only in the vaguest terms in a corporate preprint released by Meta",partial,https://ai.meta.com/resources/models-and-libraries/llama-downloads/,Download only after requesting access; requires signing a consent form,closed,,"RLHF data for Llama includes 1 million Meta-specific tuning prompts not made available, no other details known about RLHF and alignment tuning added by Xwin-LM",closed,https://huggingface.co/Xwin-LM/Xwin-LM-70B-V0.1/tree/main,Downloadable model presumably includes RLHF tuning but no documentation available,closed,https://github.com/facebookresearch/llama/blob/main/LICENSE,"Usage requires signing Meta's bespoke 'community license', not an OSI recognised open license",closed,,No documentation available.,closed,https://github.com/Xwin-LM/Xwin-LM#news,"No information available beyond that it is based on Llama and ""RLHF plays crucial role""",closed,https://github.com/Xwin-LM/Xwin-LM#news,"No preprint available; ""Coming soon (Stay tuned)""",closed,,No peer-reviewed paper available,closed,https://huggingface.co/Xwin-LM/Xwin-LM-70B-V0.1,"HuggingFace ""model card"" used to advertise model, but no details available.",closed,,Datasheet not provided.,closed,,Package not provided,partial,,API available through vllm and HuggingFace,/projects/Xwin-LM.yaml,1.0
Expand Down
Loading

0 comments on commit e7800cf

Please sign in to comment.