Skip to content
@ydataai

YData

Accelerating AI with improved data

banner_ydata

YData.ai Medium LinkedIn Twitter Youtube Data-Centric AI Discord YData Profiling YData Synthetic YData Academy

Welcome to YData

Our mission is to help data science teams access and understand their data assets, and produce quality data to sucessfully deploy machine learning models.

We're the creators of YData Fabric, the first data-centric platform for data quality. We're also strong advocates of open source software and we're actively developing ydata-profiling, ydata-synthetic, and ydata-quality, three open source projects focused on producing high-quality data for machine learning applications.

You can stay up to date with the latest developments on our News or follow our Medium blog for hands-on tutorials on our open source packages.

We have a growing community of data scientists on our Discord Server, where we discuss emergent topics on Data Profiling, Data Labeling, and Synthetic Data. Join us to share feedback and discuss feature requests!

You can also find all about our montly events and data initiatives on our newsletter or reach us at [email protected].

footer_ydata

Pinned Loading

  1. ydata-profiling ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Python 12.6k 1.7k

  2. ydata-sdk ydata-sdk Public

    Public SDK to interact with the platform, either public or private

    Python 17 5

  3. ydata-synthetic ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    Jupyter Notebook 1.5k 240

  4. academy academy Public

    Tutorials for YData's Fabric platform

    Jupyter Notebook 32 7

  5. ydata-talkdatatome ydata-talkdatatome Public

    Make your dataset talk to you. The AI assistant for data preparation.

    Python 9 1

  6. sd-metrics sd-metrics Public

    A repository that collects different metrics evaluate the quality of synthetic data under the scope data democratization. The metrics evaluate the quality of the synthetic data under the following …

    2

Repositories

Showing 10 of 71 repositories
  • aws-asg-tags-lambda Public

    A lambda that extracts the auto scaling groups from the k8s node pools provided by the user and adds the specified tags to those nodes

    ydataai/aws-asg-tags-lambda’s past year of commit activity
    Swift 5 MIT 0 1 6 Updated Dec 18, 2024
  • academy Public

    Tutorials for YData's Fabric platform

    ydataai/academy’s past year of commit activity
    Jupyter Notebook 32 MIT 7 1 4 Updated Dec 18, 2024
  • go-core Public

    Core and shared code for our go projects

    ydataai/go-core’s past year of commit activity
    Go 4 MIT 0 1 6 Updated Dec 18, 2024
  • ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    ydataai/ydata-profiling’s past year of commit activity
    Python 12,606 MIT 1,690 236 (39 issues need help) 24 Updated Dec 18, 2024
  • ydata-quality Public

    Data Quality assessment with one line of code

    ydataai/ydata-quality’s past year of commit activity
    Jupyter Notebook 429 MIT 55 19 (6 issues need help) 15 Updated Dec 17, 2024
  • swift-core Public

    Core functionality for Swift projects

    ydataai/swift-core’s past year of commit activity
    Swift 4 MIT 0 1 4 Updated Dec 16, 2024
  • aws-adapter Public

    AWS Adapter

    ydataai/aws-adapter’s past year of commit activity
    Go 0 0 1 3 Updated Dec 13, 2024
  • azure-adapter Public

    Azure Adapter

    ydataai/azure-adapter’s past year of commit activity
    Go 0 0 1 5 Updated Dec 13, 2024
  • authentication-service Public

    Handles authentication using OIDC flow

    ydataai/authentication-service’s past year of commit activity
    Go 2 MIT 0 1 9 Updated Dec 13, 2024
  • sketch-dask-extension Public

    Extension to support Sketch working with Dask Dataframes

    ydataai/sketch-dask-extension’s past year of commit activity
    Python 0 MIT 0 1 11 Updated Dec 11, 2024