Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug Report: ydata-profiling won't work in Azure Synapse #1578

Open
3 tasks done
ramonsuarez opened this issue Apr 25, 2024 · 3 comments
Open
3 tasks done

Bug Report: ydata-profiling won't work in Azure Synapse #1578

ramonsuarez opened this issue Apr 25, 2024 · 3 comments
Labels
information requested ❔ Cannot reproduce, waiting for minimum reproduction details.

Comments

@ramonsuarez
Copy link

ramonsuarez commented Apr 25, 2024

Current Behaviour

Using your databricks notebook example with a different table and and added cell for installing ydata-profiling, when I try to install ydata towards the end (and after messages about not being able to uninstall Pandas and Seaborn) I get this error:

ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
azureml-opendatasets 1.55.0 requires pandas<=2.0.0,>=0.21.0, but you have pandas 2.2.2 which is incompatible.

The kernel autostarts at the end of the install process and the the code fails because the df that is loaded at the beginning it is no longer in memory, so I have to run the first cell again.

After this I run the code that did not work (pasted into a new cell not to trigger the install again), and the next error comes when running report_html = report.to_html().

But then when it arrives to the profile_json, I do get data from the table in it.

I've installed following the instructions in your website with and without [pyspark], [notebook] and [pyspark, notebook]. The errors are the same

Pastebin with all the output.

Expected Behaviour

Install without errors and display html report in Azure Synapse notebook (pyspark)

Data Description

Dataset is one I'm practicing with that is already in my workspace. It contains metadata about the publicly available tables I've imported from a public transport company.

Code that reproduces the bug

%pip install ydata-profiling[pyspark,notebook]
from ydata_profiling import ProfileReport
import pandas as pd

report = ProfileReport(
    df,
    title="NYC yellow taxi trip",
    infer_dtypes=False,
    interactions=None,
    missing_diagrams=None,
    correlations={
        "auto": {"calculate": False},
        "pearson": {"calculate": True},
        "spearman": {"calculate": True},
    },
)
# Export the report as html and display
report_html = report.to_html()
displayHTML(report_html

pandas-profiling version

4.7.0

Dependencies

absl-py @ file:///home/conda/feedstock_root/build_artifacts/absl-py_1705494584803/work
adal @ file:///home/conda/feedstock_root/build_artifacts/adal_1617781612176/work
adlfs @ file:///home/conda/feedstock_root/build_artifacts/adlfs_1697673565385/work
aiohttp @ file:///home/conda/feedstock_root/build_artifacts/aiohttp_1707669771084/work
aiosignal @ file:///home/conda/feedstock_root/build_artifacts/aiosignal_1667935791922/work
annotated-types==0.6.0
anyio @ file:///home/conda/feedstock_root/build_artifacts/anyio_1688651106312/work/dist
applicationinsights==0.11.10
argcomplete==3.2.3
argon2-cffi @ file:///home/conda/feedstock_root/build_artifacts/argon2-cffi_1692818318753/work
argon2-cffi-bindings @ file:///home/conda/feedstock_root/build_artifacts/argon2-cffi-bindings_1695386546427/work
arrow @ file:///home/conda/feedstock_root/build_artifacts/arrow_1696128962909/work
asttokens @ file:///home/conda/feedstock_root/build_artifacts/asttokens_1698341106958/work
astunparse @ file:///home/conda/feedstock_root/build_artifacts/astunparse_1610696312422/work
async-timeout @ file:///home/conda/feedstock_root/build_artifacts/async-timeout_1691763562544/work
attrs @ file:///home/conda/feedstock_root/build_artifacts/attrs_1704011227531/work
azure-ai-ml==1.12.1
azure-common==1.1.28
azure-core @ file:///home/conda/feedstock_root/build_artifacts/azure-core_1709256515314/work
azure-datalake-store @ file:///home/conda/feedstock_root/build_artifacts/azure-datalake-store_1602801618978/work
azure-graphrbac==0.61.1
azure-identity @ file:///home/conda/feedstock_root/build_artifacts/azure-identity_1698340245585/work
azure-mgmt-authorization==4.0.0
azure-mgmt-containerregistry==10.3.0
azure-mgmt-core==1.4.0
azure-mgmt-keyvault==10.3.0
azure-mgmt-network==25.2.0
azure-mgmt-resource==23.0.1
azure-mgmt-storage==21.1.0
azure-storage-blob==12.19.0
azure-storage-file-datalake @ file:///home/conda/feedstock_root/build_artifacts/azure-storage-file-datalake_1699417359308/work
azure-storage-file-share==12.15.0
azure-synapse-ml-predict @ file:///mnt/vhdcreationfs/binaries/SynapseML-Predict/20231110.21/azure_synapse_ml_predict-1.0.0-py3-none-any.whl#sha256=5d38d0d896bffb1b2b8579ec42fba68d7edb304446179087b4120f0e292dd4e2
azureml-core==1.55.0
azureml-dataprep==5.1.6
azureml-dataprep-native==41.0.0
azureml-dataprep-rslex==2.22.2
azureml-dataset-runtime==1.55.0
azureml-mlflow==1.55.0
azureml-opendatasets==1.55.0
azureml-synapse @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/azureml_synapse-0.0.1-py3-none-any.whl#sha256=dd3e3525ff218470604ab484df8886d1cbb1e64af0b5eb7101570e0bb828f21a
azureml-telemetry==1.55.0
backcall @ file:///home/conda/feedstock_root/build_artifacts/backcall_1592338393461/work
backports.tempfile==1.0
backports.weakref==1.0.post1
bcrypt @ file:///home/conda/feedstock_root/build_artifacts/bcrypt_1702663837948/work
beautifulsoup4 @ file:///home/conda/feedstock_root/build_artifacts/beautifulsoup4_1680888073205/work
bleach @ file:///home/conda/feedstock_root/build_artifacts/bleach_1696630167146/work
blinker @ file:///home/conda/feedstock_root/build_artifacts/blinker_1698890160476/work
Brotli @ file:///home/conda/feedstock_root/build_artifacts/brotli-split_1695989787169/work
cached-property @ file:///home/conda/feedstock_root/build_artifacts/cached_property_1615209429212/work
cachetools @ file:///home/conda/feedstock_root/build_artifacts/cachetools_1708987703938/work
certifi @ file:///home/conda/feedstock_root/build_artifacts/certifi_1707022139797/work/certifi
cffi @ file:///home/conda/feedstock_root/build_artifacts/cffi_1696001684923/work
charset-normalizer @ file:///home/conda/feedstock_root/build_artifacts/charset-normalizer_1698833585322/work
click @ file:///home/conda/feedstock_root/build_artifacts/click_1692311806742/work
cloudpickle==2.2.1
clr-loader @ file:///home/conda/feedstock_root/build_artifacts/clr_loader_1691434961632/work
colorama @ file:///home/conda/feedstock_root/build_artifacts/colorama_1666700638685/work
comm @ file:///home/conda/feedstock_root/build_artifacts/comm_1710320294760/work
conda-package-handling @ file:///home/conda/feedstock_root/build_artifacts/conda-package-handling_1691048088238/work
conda_package_streaming @ file:///home/conda/feedstock_root/build_artifacts/conda-package-streaming_1691009212940/work
configparser @ file:///home/conda/feedstock_root/build_artifacts/configparser_1708794332602/work
contextlib2==21.6.0
contourpy @ file:///home/conda/feedstock_root/build_artifacts/contourpy_1699041363598/work
control-script @ file:///mnt/vhdcreationfs/binaries/AMLUtils/control_script-1.0.3-py3-none-any.whl#sha256=acca8795e7cc4fb7262fe4f728bc2f30586176d8381b8ee9a57089a01cbe351e
cryptography==41.0.7
cycler @ file:///home/conda/feedstock_root/build_artifacts/cycler_1696677705766/work
dacite==1.8.1
dash @ file:///home/conda/feedstock_root/build_artifacts/dash_1710075327199/work
dash-core-components @ file:///home/conda/feedstock_root/build_artifacts/dash-core-components_1645162191927/work
dash-cytoscape @ file:///home/conda/feedstock_root/build_artifacts/dash_cytoscape_1619800039331/work
dash-html-components @ file:///home/conda/feedstock_root/build_artifacts/dash-html-components_1645162180756/work
dash-table @ file:///home/conda/feedstock_root/build_artifacts/dash-table_1645162199872/work
databricks-cli @ file:///home/conda/feedstock_root/build_artifacts/databricks-cli_1696525400629/work
debugpy @ file:///home/conda/feedstock_root/build_artifacts/debugpy_1707444420542/work
decorator @ file:///home/conda/feedstock_root/build_artifacts/decorator_1641555617451/work
defusedxml @ file:///home/conda/feedstock_root/build_artifacts/defusedxml_1615232257335/work
dill @ file:///home/conda/feedstock_root/build_artifacts/dill_1706434688412/work
distlib @ file:///home/conda/feedstock_root/build_artifacts/distlib_1702383208639/work
docker @ file:///home/conda/feedstock_root/build_artifacts/docker-py_1702081460188/work
entrypoints @ file:///home/conda/feedstock_root/build_artifacts/entrypoints_1643888246732/work
et-xmlfile @ file:///home/conda/feedstock_root/build_artifacts/et_xmlfile_1674664118162/work
exceptiongroup @ file:///home/conda/feedstock_root/build_artifacts/exceptiongroup_1704921103267/work
executing @ file:///home/conda/feedstock_root/build_artifacts/executing_1698579936712/work
fastjsonschema @ file:///home/conda/feedstock_root/build_artifacts/python-fastjsonschema_1703780968325/work/dist
filelock @ file:///home/conda/feedstock_root/build_artifacts/filelock_1698714947081/work
Flask @ file:///home/conda/feedstock_root/build_artifacts/flask_1707043907952/work
flatbuffers @ file:///home/conda/feedstock_root/build_artifacts/python-flatbuffers_1710063001387/work
fluent-logger==0.10.0
fonttools @ file:///home/conda/feedstock_root/build_artifacts/fonttools_1708049092208/work
fqdn @ file:///home/conda/feedstock_root/build_artifacts/fqdn_1638810296540/work/dist
frozenlist @ file:///home/conda/feedstock_root/build_artifacts/frozenlist_1702645481127/work
fsspec @ file:///home/conda/feedstock_root/build_artifacts/fsspec_1707102468451/work
fsspec-wrapper==0.1.13
fusepy==3.0.1
gast @ file:///home/conda/feedstock_root/build_artifacts/gast_1688368721366/work
geographiclib @ file:///home/conda/feedstock_root/build_artifacts/geographiclib_1650904727084/work
geopy @ file:///home/conda/feedstock_root/build_artifacts/geopy_1709140178986/work
gevent @ file:///home/conda/feedstock_root/build_artifacts/gevent_1696750251337/work
gitdb @ file:///home/conda/feedstock_root/build_artifacts/gitdb_1697791558612/work
GitPython @ file:///home/conda/feedstock_root/build_artifacts/gitpython_1708069240306/work
gmpy2 @ file:///home/conda/feedstock_root/build_artifacts/gmpy2_1666808654411/work
google-api-core==2.17.1
google-auth @ file:///home/conda/feedstock_root/build_artifacts/google-auth_1709994919355/work
google-auth-oauthlib @ file:///home/conda/feedstock_root/build_artifacts/google-auth-oauthlib_1702414855226/work
google-pasta==0.2.0
googleapis-common-protos==1.63.0
greenlet @ file:///home/conda/feedstock_root/build_artifacts/greenlet_1703201576006/work
grpcio @ file:///home/conda/feedstock_root/build_artifacts/grpc-split_1700258025969/work
h5py @ file:///home/conda/feedstock_root/build_artifacts/h5py_1702471429234/work
html5lib @ file:///home/conda/feedstock_root/build_artifacts/html5lib_1592930327044/work
htmlmin==0.1.12
humanfriendly==10.0
idna @ file:///home/conda/feedstock_root/build_artifacts/idna_1701026962277/work
ImageHash==4.3.1
imageio @ file:///home/conda/feedstock_root/build_artifacts/imageio_1702571712725/work
importlib_metadata @ file:///home/conda/feedstock_root/build_artifacts/importlib-metadata_1709821103657/work
importlib_resources @ file:///home/conda/feedstock_root/build_artifacts/importlib_resources_1710342780708/work
impulse-python-handler @ file:///mnt/vhdcreationfs/binaries/Impulse/release/20240108.4/impulse_python_handler-1.0.19.1.0.0-py3-none-any.whl#sha256=6ba2906a703a337a5206d769ff817d1aae7ee62e2406ded1142e8275b6afa1eb
interpret @ file:///home/conda/feedstock_root/build_artifacts/interpret_1702548309059/work/python/interpret
interpret-core @ file:///home/conda/feedstock_root/build_artifacts/interpret-core_1702529833113/work/python/interpret-core
ipykernel @ file:///home/conda/feedstock_root/build_artifacts/ipykernel_1708996548741/work
ipython @ file:///home/conda/feedstock_root/build_artifacts/ipython_1685727741709/work
ipywidgets @ file:///home/conda/feedstock_root/build_artifacts/ipywidgets_1688489611428/work
isodate @ file:///home/conda/feedstock_root/build_artifacts/isodate_1639582763789/work
isoduration @ file:///home/conda/feedstock_root/build_artifacts/isoduration_1638811571363/work/dist
itsdangerous @ file:///home/conda/feedstock_root/build_artifacts/itsdangerous_1648147185463/work
jedi @ file:///home/conda/feedstock_root/build_artifacts/jedi_1696326070614/work
jeepney==0.8.0
Jinja2 @ file:///home/conda/feedstock_root/build_artifacts/jinja2_1704966972576/work
jmespath==1.0.1
joblib @ file:///home/conda/feedstock_root/build_artifacts/joblib_1691577114857/work
jsonpickle==3.0.3
jsonpointer @ file:///home/conda/feedstock_root/build_artifacts/jsonpointer_1695397238043/work
jsonschema @ file:///home/conda/feedstock_root/build_artifacts/jsonschema-meta_1705707496704/work
jsonschema-specifications @ file:///tmp/tmpkv1z7p57/src
jupyter-events @ file:///home/conda/feedstock_root/build_artifacts/jupyter_events_1710262592257/work
jupyter-ui-poll==0.2.2
jupyter_client @ file:///home/conda/feedstock_root/build_artifacts/jupyter_client_1710255804825/work
jupyter_core @ file:///home/conda/feedstock_root/build_artifacts/jupyter_core_1710257277185/work
jupyter_server @ file:///home/conda/feedstock_root/build_artifacts/jupyter_server_1687869799272/work
jupyter_server_terminals @ file:///home/conda/feedstock_root/build_artifacts/jupyter_server_terminals_1710262634903/work
jupyterlab_pygments @ file:///home/conda/feedstock_root/build_artifacts/jupyterlab_pygments_1707149102966/work
jupyterlab_widgets @ file:///home/conda/feedstock_root/build_artifacts/jupyterlab_widgets_1707421892171/work
keras @ file:///home/conda/feedstock_root/build_artifacts/keras_1700038168849/work/keras-2.15.0-py3-none-any.whl#sha256=2dcc6d2e30cf9c951064b63c1f4c404b966c59caf09e01f3549138ec8ee0dd1f
kiwisolver @ file:///home/conda/feedstock_root/build_artifacts/kiwisolver_1695379902431/work
knack==0.11.0
liac-arff @ file:///home/conda/feedstock_root/build_artifacts/liac-arff_1612178018045/work
library-metadata-cooker @ file:///mnt/vhdcreationfs/binaries/library-metadata-cooker/20230803.1/library_metadata_cooker-0.0.7-py3-none-any.whl#sha256=a1be26c7df7d247a054b44e88c9d6a23d2f893782df59df3e0d62469e10a9e07
lightgbm @ file:///home/conda/feedstock_root/build_artifacts/lightgbm_1703249212667/work
llvmlite==0.42.0
lxml @ file:///home/conda/feedstock_root/build_artifacts/lxml_1704724217654/work
Markdown @ file:///home/conda/feedstock_root/build_artifacts/markdown_1698797478597/work
MarkupSafe @ file:///home/conda/feedstock_root/build_artifacts/markupsafe_1706899921127/work
marshmallow==3.21.1
matplotlib @ file:///home/conda/feedstock_root/build_artifacts/matplotlib-suite_1700509477637/work
matplotlib-inline @ file:///home/conda/feedstock_root/build_artifacts/matplotlib-inline_1660814786464/work
mistune @ file:///home/conda/feedstock_root/build_artifacts/mistune_1698947099619/work
mkl-service==2.4.1
mkl_fft==1.3.8
mkl_random @ file:///home/conda/feedstock_root/build_artifacts/mkl_random_1707959967054/work
ml-dtypes @ file:///home/conda/feedstock_root/build_artifacts/ml_dtypes_1695280938812/work
mlflow-skinny @ file:///home/conda/feedstock_root/build_artifacts/mlflow-split_1702568771627/work
mltable==1.6.1
mpmath @ file:///home/conda/feedstock_root/build_artifacts/mpmath_1678228039184/work
msal @ file:///home/conda/feedstock_root/build_artifacts/msal_1708642030588/work
msal-extensions @ file:///home/conda/feedstock_root/build_artifacts/msal_extensions_1706838290559/work
msgpack==1.0.8
msrest @ file:///home/conda/feedstock_root/build_artifacts/msrest_1662028346497/work
msrestazure==0.6.4
multidict @ file:///home/conda/feedstock_root/build_artifacts/multidict_1707040698785/work
multimethod==1.11.2
multiprocess @ file:///home/conda/feedstock_root/build_artifacts/multiprocess_1706514640841/work
munkres==1.1.4
mypy==1.4.1
mypy-extensions==1.0.0
nbclient @ file:///home/conda/feedstock_root/build_artifacts/nbclient_1710317608672/work
nbconvert @ file:///home/conda/feedstock_root/build_artifacts/nbconvert-meta_1709581262419/work
nbformat @ file:///home/conda/feedstock_root/build_artifacts/nbformat_1710262332870/work
ndg-httpsclient==0.5.1
nest_asyncio @ file:///home/conda/feedstock_root/build_artifacts/nest-asyncio_1705850609492/work
networkx @ file:///home/conda/feedstock_root/build_artifacts/networkx_1698504735452/work
notebookutils @ file:///mnt/vhdcreationfs/binaries/NotebookUtils/20240309.2/notebookutils-3.4.1_20240309.2-py3-none-any.whl#sha256=6d8e85e4a49864fcaa4312f31015580453520d6a5e2d809cb87fe9c3ebc07172
numba @ file:///home/conda/feedstock_root/build_artifacts/numba_1707024805237/work
numpy==1.23.5
oauthlib @ file:///home/conda/feedstock_root/build_artifacts/oauthlib_1666056362788/work
onnx==1.15.0
opencensus==0.11.4
opencensus-context==0.1.3
opencensus-ext-azure==1.1.13
openpyxl @ file:///home/conda/feedstock_root/build_artifacts/openpyxl_1695464693876/work
opt-einsum @ file:///home/conda/feedstock_root/build_artifacts/opt_einsum_1696448916724/work
overrides @ file:///home/conda/feedstock_root/build_artifacts/overrides_1706394519472/work
packaging @ file:///home/conda/feedstock_root/build_artifacts/packaging_1696202382185/work
pandas==2.2.2
pandasql @ file:///home/conda/feedstock_root/build_artifacts/pandasql_1623699966879/work
pandocfilters @ file:///home/conda/feedstock_root/build_artifacts/pandocfilters_1631603243851/work
paramiko @ file:///home/conda/feedstock_root/build_artifacts/paramiko_1703015906107/work
parso @ file:///home/conda/feedstock_root/build_artifacts/parso_1638334955874/work
pathos @ file:///home/conda/feedstock_root/build_artifacts/pathos_1706533117008/work
pathspec==0.12.1
patsy @ file:///home/conda/feedstock_root/build_artifacts/patsy_1704469236901/work
pexpect @ file:///home/conda/feedstock_root/build_artifacts/pexpect_1706113125309/work
phik==0.12.4
pickleshare @ file:///home/conda/feedstock_root/build_artifacts/pickleshare_1602536217715/work
pillow @ file:///home/conda/feedstock_root/build_artifacts/pillow_1704252020178/work
pkginfo==1.10.0
pkgutil_resolve_name @ file:///home/conda/feedstock_root/build_artifacts/pkgutil-resolve-name_1694617248815/work
platformdirs @ file:///home/conda/feedstock_root/build_artifacts/platformdirs_1696272223550/work
plotly @ file:///home/conda/feedstock_root/build_artifacts/plotly_1698272730927/work
ply==3.11
portalocker @ file:///home/conda/feedstock_root/build_artifacts/portalocker_1695662047585/work
powerbiclient==3.1.1
pox @ file:///home/conda/feedstock_root/build_artifacts/pox_1706431181924/work
ppft @ file:///home/conda/feedstock_root/build_artifacts/ppft_1706409481851/work
prettytable @ file:///home/conda/feedstock_root/build_artifacts/prettytable_1694464263010/work
prometheus_client @ file:///home/conda/feedstock_root/build_artifacts/prometheus_client_1707932675456/work
prompt-toolkit @ file:///home/conda/feedstock_root/build_artifacts/prompt-toolkit_1702399386289/work
protobuf==4.24.4
psutil @ file:///home/conda/feedstock_root/build_artifacts/psutil_1705722392846/work
ptyprocess @ file:///home/conda/feedstock_root/build_artifacts/ptyprocess_1609419310487/work/dist/ptyprocess-0.7.0-py2.py3-none-any.whl
pure-eval @ file:///home/conda/feedstock_root/build_artifacts/pure_eval_1642875951954/work
py4j==0.10.9.7
pyarrow==14.0.2
pyasn1 @ file:///home/conda/feedstock_root/build_artifacts/pyasn1_1701287008248/work
pyasn1-modules @ file:///home/conda/feedstock_root/build_artifacts/pyasn1-modules_1695107857548/work
pycairo==1.26.0
pycosat @ file:///home/conda/feedstock_root/build_artifacts/pycosat_1696355758174/work
pycparser @ file:///home/conda/feedstock_root/build_artifacts/pycparser_1636257122734/work
pydantic==2.7.1
pydantic_core==2.18.2
pydash==7.0.5
Pygments @ file:///home/conda/feedstock_root/build_artifacts/pygments_1700607939962/work
PyGObject==3.48.1
PyJWT @ file:///home/conda/feedstock_root/build_artifacts/pyjwt_1706895065046/work
PyNaCl @ file:///home/conda/feedstock_root/build_artifacts/pynacl_1695544850803/work
pyodbc @ file:///home/conda/feedstock_root/build_artifacts/pyodbc_1697223139819/work
pyOpenSSL==23.3.0
pyparsing @ file:///home/conda/feedstock_root/build_artifacts/pyparsing_1709721012883/work
pyperclip @ file:///home/conda/feedstock_root/build_artifacts/pyperclip_1622337600177/work
PyQt5==5.15.9
PyQt5-sip==12.12.2
PySocks @ file:///home/conda/feedstock_root/build_artifacts/pysocks_1661604839144/work
pyspark==3.4.1.5.3.20230713
python-dateutil @ file:///home/conda/feedstock_root/build_artifacts/python-dateutil_1709299778482/work
python-json-logger @ file:///home/conda/feedstock_root/build_artifacts/python-json-logger_1677079630776/work
pythonnet @ file:///home/conda/feedstock_root/build_artifacts/pythonnet_1697010695704/work
pytz @ file:///home/conda/feedstock_root/build_artifacts/pytz_1706549378554/work
pyu2f @ file:///home/conda/feedstock_root/build_artifacts/pyu2f_1604248910016/work
PyWavelets==1.6.0
PyYAML @ file:///home/conda/feedstock_root/build_artifacts/pyyaml_1695373428874/work
pyzmq @ file:///home/conda/feedstock_root/build_artifacts/pyzmq_1701783169502/work
referencing @ file:///home/conda/feedstock_root/build_artifacts/referencing_1706711412823/work
regex @ file:///home/conda/feedstock_root/build_artifacts/regex_1703393490683/work
requests @ file:///home/conda/feedstock_root/build_artifacts/requests_1684774241324/work
requests-oauthlib @ file:///home/conda/feedstock_root/build_artifacts/requests-oauthlib_1710149579371/work
retrying @ file:///Users/ktietz/demo/mc3/conda-bld/retrying_1629465456590/work
rfc3339-validator @ file:///home/conda/feedstock_root/build_artifacts/rfc3339-validator_1638811747357/work
rfc3986-validator @ file:///home/conda/feedstock_root/build_artifacts/rfc3986-validator_1598024191506/work
rpds-py @ file:///home/conda/feedstock_root/build_artifacts/rpds-py_1707922703488/work
rsa @ file:///home/conda/feedstock_root/build_artifacts/rsa_1658328885051/work
ruamel-yaml-conda @ file:///home/conda/feedstock_root/build_artifacts/ruamel_yaml_1695546328261/work
ruamel.yaml @ file:///home/conda/feedstock_root/build_artifacts/ruamel.yaml_1699007337104/work
ruamel.yaml.clib @ file:///home/conda/feedstock_root/build_artifacts/ruamel.yaml.clib_1695996839082/work
SALib @ file:///home/conda/feedstock_root/build_artifacts/salib_1708298432030/work
scikit-learn @ file:///home/conda/feedstock_root/build_artifacts/scikit-learn_1701931992027/work
scipy @ file:///croot/scipy_1701295040508/work/dist/scipy-1.11.4-cp310-cp310-linux_x86_64.whl#sha256=a5e5a33a143f627fd4bde622162664c9ea97ff3af56f51c0f4c76ec9d1a340a3
seaborn==0.12.2
SecretStorage==3.3.3
Send2Trash @ file:///home/conda/feedstock_root/build_artifacts/send2trash_1682601222253/work
shap @ file:///home/conda/feedstock_root/build_artifacts/shap_1704716387123/work
sip @ file:///home/conda/feedstock_root/build_artifacts/sip_1697300428978/work
six @ file:///home/conda/feedstock_root/build_artifacts/six_1620240208055/work
slicer @ file:///home/conda/feedstock_root/build_artifacts/slicer_1608146800664/work
smmap @ file:///home/conda/feedstock_root/build_artifacts/smmap_1634310307496/work
sniffio @ file:///home/conda/feedstock_root/build_artifacts/sniffio_1708952932303/work
soupsieve @ file:///home/conda/feedstock_root/build_artifacts/soupsieve_1693929250441/work
SQLAlchemy @ file:///home/conda/feedstock_root/build_artifacts/sqlalchemy_1709646228484/work
sqlanalyticsconnectorpy @ file:///mnt/vhdcreationfs/binaries/SQLAnalyticsConnector/Python/release-3.3/sqlanalyticsconnectorpy-1.0.1-py3-none-any.whl#sha256=90bdf9e7f6f9d80dbe7f75cebc326bca133d455c55e6fdb869697c883dff071b
sqlparse @ file:///home/conda/feedstock_root/build_artifacts/sqlparse_1681817562700/work
stack-data @ file:///home/conda/feedstock_root/build_artifacts/stack_data_1669632077133/work
statsmodels @ file:///home/conda/feedstock_root/build_artifacts/statsmodels_1702575356319/work
strictyaml==1.7.3
sympy @ file:///home/conda/feedstock_root/build_artifacts/sympy_1684180540116/work
synapseml-cognitive @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/synapseml_cognitive-1.0.2-py2.py3-none-any.whl#sha256=2f6bf94c24b344f6baaf6a3df7e41f134db1a5ae34a02d41d31f4d4f8b62984f
synapseml-core @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/synapseml_core-1.0.2-py2.py3-none-any.whl#sha256=36bfe0ffd517ecd6c6540d7ec8afc8681f38a443f157d498b36eb5c80b11a39d
synapseml-deep-learning @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/synapseml_deep_learning-1.0.2-py2.py3-none-any.whl#sha256=4fb7fee1282f2c1730523d126a28584704e7636ba9c70a9d8ba188f2ea350100
synapseml-internal @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/synapseml_internal-1.0.2.1.dev1-py2.py3-none-any.whl#sha256=b829ac4af7abc5213d3048d9fdd0c0de426f27d78a729bd0701f56c76bded06c
synapseml-lightgbm @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/synapseml_lightgbm-1.0.2-py2.py3-none-any.whl#sha256=4a55e4b791db4a69797932cbb2653f162f4ca9788164b6c62c61d64b5b159332
synapseml-opencv @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/synapseml_opencv-1.0.2-py2.py3-none-any.whl#sha256=ec0426924cb4419b1c9782ac0d78a84a43357a290e6e0fdd6d73a5f2c2b1080f
synapseml-vw @ file:///mnt/vhdcreationfs/binaries/SynapseML-MMLSpark/20231222.1/synapseml/python/synapseml_vw-1.0.2-py2.py3-none-any.whl#sha256=ff9ed632fd804a80a13f8bb4d6262224b43d90708f1b2828be99f4b433e9ffda
tabulate @ file:///home/conda/feedstock_root/build_artifacts/tabulate_1665138452165/work
tenacity @ file:///home/conda/feedstock_root/build_artifacts/tenacity_1692026804430/work
tensorboard @ file:///home/conda/feedstock_root/build_artifacts/tensorboard_1707486779203/work/tensorboard-2.15.2-py3-none-any.whl#sha256=a6f6443728064d962caea6d34653e220e34ef8df764cb06a8212c17e1a8f0622
tensorboard-data-server @ file:///home/conda/feedstock_root/build_artifacts/tensorboard-data-server_1695425366946/work/tensorboard_data_server-0.7.0-py3-none-manylinux2014_x86_64.whl#sha256=aa1f69b2111bb4309cc6277ac277c89a9f67d074aa666b96eebe7401a359e1d5
tensorflow @ file:///home/conda/feedstock_root/build_artifacts/tensorflow-split_1705375591437/work/tensorflow_pkg/tensorflow-2.15.0-cp310-cp310-linux_x86_64.whl#sha256=8b64914c5b086624aeb1244054fc8e437dfa107e22109a7609d0996d61a6dbc8
tensorflow_estimator @ file:///home/conda/feedstock_root/build_artifacts/tensorflow-split_1705375591437/work/tensorflow-estimator/wheel_dir/tensorflow_estimator-2.15.0-py2.py3-none-any.whl#sha256=b0345315b95e8a8d5da95d3d481e9adcffe5e967377d0e4834092a49a7d950f8
termcolor @ file:///home/conda/feedstock_root/build_artifacts/termcolor_1704357939450/work
terminado @ file:///home/conda/feedstock_root/build_artifacts/terminado_1710262609923/work
threadpoolctl @ file:///home/conda/feedstock_root/build_artifacts/threadpoolctl_1707930541534/work
tinycss2 @ file:///home/conda/feedstock_root/build_artifacts/tinycss2_1666100256010/work
toml @ file:///home/conda/feedstock_root/build_artifacts/toml_1604308577558/work
tomli @ file:///home/conda/feedstock_root/build_artifacts/tomli_1644342247877/work
toolz @ file:///home/conda/feedstock_root/build_artifacts/toolz_1706112571092/work
torch==2.0.1
tornado @ file:///home/conda/feedstock_root/build_artifacts/tornado_1708363098266/work
tqdm @ file:///home/conda/feedstock_root/build_artifacts/tqdm_1707598593068/work
traitlets @ file:///home/conda/feedstock_root/build_artifacts/traitlets_1710254411456/work
typed-ast @ file:///home/conda/feedstock_root/build_artifacts/typed-ast_1695409894288/work
typeguard==4.2.1
types-python-dateutil @ file:///home/conda/feedstock_root/build_artifacts/types-python-dateutil_1710134223430/work
typing-utils @ file:///home/conda/feedstock_root/build_artifacts/typing_utils_1622899189314/work
typing_extensions @ file:///home/conda/feedstock_root/build_artifacts/typing_extensions_1708904622550/work
tzdata==2024.1
unicodedata2 @ file:///home/conda/feedstock_root/build_artifacts/unicodedata2_1695847980273/work
uri-template @ file:///home/conda/feedstock_root/build_artifacts/uri-template_1688655812972/work/dist
urllib3 @ file:///home/conda/feedstock_root/build_artifacts/urllib3_1699933488691/work
virtualenv @ file:///home/conda/feedstock_root/build_artifacts/virtualenv_1687005325630/work
visions==0.7.6
wcwidth @ file:///home/conda/feedstock_root/build_artifacts/wcwidth_1704731205417/work
webcolors @ file:///home/conda/feedstock_root/build_artifacts/webcolors_1679900785843/work
webencodings @ file:///home/conda/feedstock_root/build_artifacts/webencodings_1694681268211/work
websocket-client @ file:///home/conda/feedstock_root/build_artifacts/websocket-client_1701630677416/work
Werkzeug @ file:///home/conda/feedstock_root/build_artifacts/werkzeug_1698235201373/work
widgetsnbextension @ file:///home/conda/feedstock_root/build_artifacts/widgetsnbextension_1707420319466/work
wordcloud==1.9.3
wrapt @ file:///home/conda/feedstock_root/build_artifacts/wrapt_1666806031361/work
xgboost @ file:///home/conda/feedstock_root/build_artifacts/xgboost-split_1705649946806/work/python-package
yarl @ file:///home/conda/feedstock_root/build_artifacts/yarl_1705508292061/work
ydata-profiling==4.7.0
zipp @ file:///home/conda/feedstock_root/build_artifacts/zipp_1695255097490/work
zope.event @ file:///home/conda/feedstock_root/build_artifacts/zope.event_1687705558811/work
zope.interface @ file:///home/conda/feedstock_root/build_artifacts/zope.interface_1708077566292/work
zstandard==0.22.0
Note: you may need to restart the kernel to use updated packages.

OS

spark on Azure Synapse

Checklist

  • There is not yet another bug report for this issue in the issue tracker
  • The problem is reproducible from this bug report. This guide can help to craft a minimal bug report.
  • The issue has not been resolved by the entries listed under Common Issues.
@fabclmnt
Copy link
Contributor

Hi @ramonsuarez ,

thank you for your request, you have to make sure that you install the version of YData-profiling that is compatible with Synapses pre-installed packages. We are not responsible for synapses environment and packages that they are currently using.

As an example, synapse's environment is using an older pandas version, as per the message you are getting azureml-opendatasets 1.55.0 requires pandas<=2.0.0. I suggest that you install an ydata-profiling version that enables you to have a pandas version that is under 2.0.0 using a ydata-profiling under version v4.6.2, as it is the last version we have released with pandas v1.

You can check our release history here: https://github.com/ydataai/ydata-profiling/releases

Let me know if this was helpful.

@fabclmnt fabclmnt added information requested ❔ Cannot reproduce, waiting for minimum reproduction details. and removed needs-triage labels Apr 25, 2024
@ramonsuarez
Copy link
Author

ramonsuarez commented Apr 30, 2024

Thanks a lot @fabclmnt . These notes are for your information, I hope they are useful.

I've tested both in Synapse and Fabric using a different Yellow Taxi db (huge) because I couldn't load the one in your databricks blogpost:

  • Synapse; I went back to versions 4.0.0 and 4.2.0 and couldn't get it to work with the first adapted example nor with the Yellow Taxi example code from your databricks blogpost. There was an issue with 'numba' not having 'generated_jit'. So I updated to numba==0.58.1, and the report run but redering it as html ended up in a division by zero error, that I've seen in your issues queue has been fixed in later versions of ydata-profiling.

  • Fabric: it run! I ended up with a very large yellow taxi dataset and it took over 4h . During the installation it produced this error with v4.0.0:

ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. sentence-transformers 2.0.0 requires sentencepiece, which is not installed. sentence-transformers 2.0.0 requires torchvision, which is not installed. dash 2.14.0 requires Flask<2.3.0,>=1.0.4, but you have flask 3.0.0 which is incompatible. dash 2.14.0 requires Werkzeug<2.3.0, but you have werkzeug 3.0.1 which is incompatible. nni 2.10.1 requires typeguard<3, but you have typeguard 4.2.1 which is incompatible. tensorflow 2.12.1 requires typing-extensions<4.6.0,>=3.6.6, but you have typing-extensions 4.11.0 which is incompatible.

@fabclmnt
Copy link
Contributor

fabclmnt commented May 6, 2024

Hi  @ramonsuarez ,

thank you for your inputs. Regarding the error, indeed we have it fixed with later versions, and unfortunately we can't ensure or control the versions that other platforms are using in terms of pandas, numpy and other core libraries.

Based on Fabric error message, they are using older versions for packages such as typeguard and typing-extensions due to the use of an older version of Tensorflow. I would suggest to report to Azure in order to have the packages on their side upgraded.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
information requested ❔ Cannot reproduce, waiting for minimum reproduction details.
Projects
None yet
Development

No branches or pull requests

3 participants