Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to visualize metadata (Error: bad label format and CalledProcessError) #1625

Closed
npatki opened this issue Oct 11, 2023 · 1 comment · Fixed by #1670
Closed

Unable to visualize metadata (Error: bad label format and CalledProcessError) #1625

npatki opened this issue Oct 11, 2023 · 1 comment · Fixed by #1670
Assignees
Labels
bug Something isn't working feature:metadata Related to describing the dataset
Milestone

Comments

@npatki
Copy link
Contributor

npatki commented Oct 11, 2023

Environment Details

  • SDV version: 1.4.0 (latest)
  • Python version: 3.10
  • Operating System: Linux (Colab Notebook)

Error Description

When attempting to visualize my metadata for a single table, I am encountering a CalledProcessError from the underlying graphing library (graphviz). It appears that there is something misconfigured in the label as the first line is Error: bad label format.

Steps to reproduce

Download the Income dataset from Kaggle. You can use the train.csv function to auto detect metadata and attempt to visualize it.

import pandas as pd
from sdv.metadata import SingleTableMetadata

real_data = pd.read_csv('train.csv')
real_data['income_>50K'] = real_data['income_>50K'].astype(str)

metadata = SingleTableMetadata()
metadata.detect_from_dataframe(real_data)

metadata.visualize()

Output:

Error: bad label format 
...
CalledProcessError: '[PosixPath('dot'), '-Kdot', '-Tsvg']' returned non-zero exit status 1. ...

Full stack trace is attached below.
stack_trace.txt

@npatki npatki added bug Something isn't working feature:metadata Related to describing the dataset labels Oct 11, 2023
@amontanez24 amontanez24 added this to the 1.6.1 milestone Nov 13, 2023
@FlyingDutchman26
Copy link

I have the same problem!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working feature:metadata Related to describing the dataset
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants