Add Amazon Titan #2165

yifanmai · 2023-12-19T22:13:03Z

Also adds BedrockClient which can be specialized to run other models on Amazon Bedrock.

percyliang · 2023-12-20T04:15:18Z

src/helm/proxy/clients/bedrock_utils.py

+
+    if assumed_role:
+        sts = session.client("sts")
+        response = sts.assume_role(RoleArn=str(assumed_role), RoleSessionName="langchain-llm-1")


Why is this called langchain?

The notebook that this code was sourced from (see the code comment for the URL) also had some LangChain tutorials, so I assume they just called it langchain because of that.

From the assume_role docs, it looks like this is just an arbitrary user-defined tag, so I'll set it to "crfm-helm" instead.

JosselinSomervilleRoberts · 2024-02-07T00:18:47Z

@yifanmai , do we want to merge this?

yifanmai · 2024-02-07T00:41:31Z

@JosselinSomervilleRoberts this is ready for review, could you take a look?

JosselinSomervilleRoberts · 2024-02-13T08:42:31Z

src/helm/config/model_metadata.yaml

+  # - https://aws.amazon.com/about-aws/whats-new/2023/11/amazon-titan-models-express-lite-bedrock/
+  - name: amazon/titan-text-lite-v1
+    display_name: Amazon Titan Text Lite
+    description: Amazon Titan Text Lite is a lightweight, efficient model perfect for fine-tuning English-language tasks like summarization and copywriting. It caters to customers seeking a smaller, cost-effective, and highly customizable model. It supports various formats, including text generation, code generation, rich text formatting, and orchestration (agents). Key model attributes encompass fine-tuning, text generation, code generation, and rich text formatting.


These descriptions seem a bit subjectives. I thought we were making descriptions about scientific content not so much use cases

These are copy and pasted from the linked blog post. I wasn't able to find a model card or paper with a more suitable description.

JosselinSomervilleRoberts · 2024-02-13T08:44:18Z

src/helm/config/model_deployments.yaml

Shouldn't we have the optional args provided here ? Like bedrock_model_id ?

JosselinSomervilleRoberts · 2024-02-13T08:45:39Z

src/helm/proxy/clients/bedrock_client.py

+        raw_request = self.convert_request_to_raw_request(request)
+
+        # modelId isn't part of raw_request, so it must be explicitly passed into the input to
+        raw_request_for_cache: Dict = {"modelId": model_id, **deepcopy(raw_request)}


We usually don't use camel case for cache keys

Using camel case will avoid having to do extra post-processing before sending to the server. Generally, we want the cache key to be as close to the actual parameters that we sent to the API as possible. See AI21Client for another example of a camel case request.

JosselinSomervilleRoberts · 2024-02-13T08:46:19Z

src/helm/proxy/clients/bedrock_client.py

+            "inputText": request.prompt,
+            "textGenerationConfig": {
+                "maxTokenCount": request.max_tokens,
+                # Sending a non-empty stopSequences results in an error:
+                # https://github.com/boto/boto3/issues/3993
+                "stopSequences": request.stop_sequences or [],
+                "temperature": request.temperature,
+                "topP": request.top_p,


yifanmai requested review from JosselinSomervilleRoberts and percyliang December 20, 2023 02:03

percyliang reviewed Dec 20, 2023

View reviewed changes

yifanmai mentioned this pull request Dec 20, 2023

adding sagemaker support to stanford-crfm/helm #1869

Closed

JosselinSomervilleRoberts approved these changes Feb 13, 2024

View reviewed changes

yifanmai added 2 commits February 16, 2024 16:23

Add Amazon Titan

cda6540

Truncation

63b5dce

yifanmai force-pushed the yifanmai/fix-add-titan branch from 1cdb05c to 63b5dce Compare February 17, 2024 00:43

original_finish_reason

a1d30ad

yifanmai merged commit aa3e20b into main Feb 17, 2024
6 checks passed

yifanmai deleted the yifanmai/fix-add-titan branch February 17, 2024 00:59

yifanmai mentioned this pull request Aug 5, 2024

Guidance on how to run HELM benchmarks against LLM models deployed on SageMaker Endpoints #1713

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Amazon Titan #2165

Add Amazon Titan #2165

yifanmai commented Dec 19, 2023 •

edited

Loading

percyliang Dec 20, 2023

yifanmai Dec 20, 2023

JosselinSomervilleRoberts commented Feb 7, 2024

yifanmai commented Feb 7, 2024

JosselinSomervilleRoberts Feb 13, 2024

yifanmai Feb 17, 2024

JosselinSomervilleRoberts Feb 13, 2024

JosselinSomervilleRoberts Feb 13, 2024

yifanmai Feb 17, 2024

JosselinSomervilleRoberts Feb 13, 2024

yifanmai Feb 17, 2024

Add Amazon Titan #2165

Add Amazon Titan #2165

Conversation

yifanmai commented Dec 19, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JosselinSomervilleRoberts commented Feb 7, 2024

yifanmai commented Feb 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yifanmai commented Dec 19, 2023 •

edited

Loading