Handling OpenAI 429's gracefully #4153

pascalwhoop · 2024-07-25T06:25:38Z

Expected Behavior (Mandatory)

Ability to control OpenAI backoff strategy for large volume of embeddings calls. This is standard practice in almost any library I've used because we cannot assume we have infinite capacity from our API providers.

Actual Behavior (Mandatory)

] version=71, last transaction in previous log=5140, rotation took 51 millis, started after 7843 millis."}
{"time":"2024-07-24 22:57:06.966+0000","level":"WARN","category":"o.n.k.a.p.GlobalProcedures","message":"Error during iterate.commit:"}
{"time":"2024-07-24 22:57:06.966+0000","level":"WARN","category":"o.n.k.a.p.GlobalProcedures","message":"1887 times: org.neo4j.graphdb.QueryExecutionException: Failed to invoke procedure `apoc.ml.openai.embedding`: Caused by: java.io.IOException: Server returned HTTP response code: 429 for URL: https://api.openai.com/v1/embeddings"}
{"time":"2024-07-24 22:57:06.966+0000","level":"WARN","category":"o.n.k.a.p.GlobalProcedures","message":"332 times: org.neo4j.graphdb.QueryExecutionException: Failed to invoke procedure `apoc.ml.openai.embedding`: Caused by: java.net.SocketTimeoutException: Connect timed out"}
{"time":"2024-07-24 22:57:06.966+0000","level":"WARN","category":"o.n.k.a.p.GlobalProcedures","message":"Error during iterate.execute:"}
{"time":"2024-07-24 22:57:06.966+0000","level":"WARN","category":"o.n.k.a.p.GlobalProcedures","message":"332 times: Connect timed out"}
{"time":"2024-07-24 22:57:06.966+0000","level":"WARN","category":"o.n.k.a.p.GlobalProcedures","message":"1887 times: Server returned HTTP response code: 429 for URL: https://api.openai.com/v1/embeddings"}

How to Reproduce the Problem

Try embedding 5M nodes at 2000 nodes batched per API request (to maximise throughput) so you end up hitting the 429 for too many tokens per minute

Specifications (Mandatory)

CALL apoc.periodic.iterate(
    'MATCH (p:`Entity`) RETURN p', 
    'CALL apoc.ml.openai.embedding([item in $_batch | item.p.name], $apiKey, {endpoint: $endpoint, model: $model}) YIELD index, text, embedding CALL apoc.create.setProperty($_batch[index].p, $attribute, embedding) YIELD node RETURN node', 
    {`batchMode`: 'BATCH_SINGLE', `batchSize`: 2000, `concurrency`: 50, `parallel`: 'true', `params`: {`apiKey`: 'KEY', `attribute`: 'embedding', `endpoint`: 'https://api.openai.com/v1', `model`: 'text-embedding-3-small'}}
) YIELD batch, operations

Currently used versions

# pypher
python-cypher==0.20.1

helm chart
- name: neo4j 
  version: 5.20.0
  repository: https://neo4j.github.io/helm-charts/

Versions

OS: GKE
Neo4j: 5.20.0
Neo4j-Apoc: 5.20.0

The text was updated successfully, but these errors were encountered:

* Fixes #4153: Handling OpenAI 429's gracefully * cleanup * fix tests

Lojjs added the extended-functionality label Aug 5, 2024

jexp added this to APOC Extended Larus Sep 5, 2024

mpetrini-larus mentioned this issue Nov 27, 2024

[Issue 4153] Adds a backoff strategy to OpenAI API calls mpetrini-larus/neo4j-apoc-procedures#3

Open

vga91 added a commit that referenced this issue Dec 10, 2024

Fixes #4153: Handling OpenAI 429's gracefully

9555a20

vga91 added a commit that referenced this issue Dec 10, 2024

Fixes #4153: Handling OpenAI 429's gracefully

c827c76

vga91 mentioned this issue Dec 10, 2024

Fixes #4153: Handling OpenAI 429's gracefully #4284

Merged

RobertoSannino pushed a commit that referenced this issue Dec 11, 2024

Fixes #4153: Handling OpenAI 429's gracefully (#4284)

5da8113

* Fixes #4153: Handling OpenAI 429's gracefully * cleanup * fix tests

vga91 added a commit that referenced this issue Dec 11, 2024

Fixes #4153: Handling OpenAI 429's gracefully (#4284)

421f0bd

* Fixes #4153: Handling OpenAI 429's gracefully * cleanup * fix tests

vga91 mentioned this issue Dec 11, 2024

Fixes #4153: Handling OpenAI 429's gracefully (#4284) #4301

Merged

vga91 closed this as completed Dec 11, 2024

github-project-automation bot moved this from In Progress to Done (check if cherry-pick) in APOC Extended Larus Dec 11, 2024

vga91 added a commit that referenced this issue Dec 11, 2024

Fixes #4153: Handling OpenAI 429's gracefully (#4284) (#4301)

bf0dd2d

* Fixes #4153: Handling OpenAI 429's gracefully * cleanup * fix tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling OpenAI 429's gracefully #4153

Handling OpenAI 429's gracefully #4153

pascalwhoop commented Jul 25, 2024 •

edited

Loading

Handling OpenAI 429's gracefully #4153

Handling OpenAI 429's gracefully #4153

Comments

pascalwhoop commented Jul 25, 2024 • edited Loading

Expected Behavior (Mandatory)

Actual Behavior (Mandatory)

How to Reproduce the Problem

Specifications (Mandatory)

Versions

pascalwhoop commented Jul 25, 2024 •

edited

Loading