Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed bug where nested json inside pandas wouldn't be ingested correctly #568

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

AsafMah
Copy link
Collaborator

@AsafMah AsafMah commented Feb 18, 2025

solves #567

Copy link

github-actions bot commented Feb 18, 2025

Test Results

    6 files  ±0      6 suites  ±0   25m 43s ⏱️ -29s
  314 tests ±0    279 ✅ ±0   35 💤 ±0  0 ❌ ±0 
1 884 runs  ±0  1 674 ✅ ±0  210 💤 ±0  0 ❌ ±0 

Results for commit 5d04968. ± Comparison against base commit c2e31d3.

♻️ This comment has been updated with latest results.

@@ -117,12 +115,11 @@ def ingest_from_dataframe(self, df: "pandas.DataFrame", ingestion_properties: In
if not isinstance(df, DataFrame):
raise ValueError("Expected DataFrame instance, found {}".format(type(df)))

file_name = "df_{id}_{timestamp}_{uid}.csv.gz".format(id=id(df), timestamp=int(time.time()), uid=uuid.uuid4())
file_name = "df_{id}_{timestamp}_{uid}.json.gz".format(id=id(df), timestamp=int(time.time()), uid=uuid.uuid4())
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is a breaking change - as this wont work the same for users who configured csv mapping

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's a good point, but before we had:

        ingestion_properties.format = DataFormat.CSV

So if a user used a different format mapping it wouldn't work.

I can roll it in the next breaking version (with a comment on the release notes),
Or I can check for a mapping and error? Or if the format doesn't match error?

I don't think we can get around not converting it to json

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants