Skip to content
Change the repository type filter

All

    Repositories list

    • Help augment diagnostic workflows with this Databricks Solution Accelerator for pathology image analysis. Now you can rapidly process thousands of whole slide images in minutes and use machine learning to automate the detection of metastasis.
      Python
      Other
      12000Updated Feb 3, 2025Feb 3, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      28k000Updated Feb 3, 2025Feb 3, 2025
    • Public runnable examples of using John Snow Labs' NLP for Apache Spark.
      Jupyter Notebook
      Apache License 2.0
      607000Updated Feb 3, 2025Feb 3, 2025
    • composer

      Public
      Supercharge Your Model Training
      Python
      Apache License 2.0
      431000Updated Feb 3, 2025Feb 3, 2025
    • dbdemos

      Public
      Demos to implement your Databricks Lakehouse
      HTML
      Other
      103000Updated Feb 3, 2025Feb 3, 2025
    • The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.
      HCL
      Other
      46000Updated Feb 3, 2025Feb 3, 2025
    • DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
      Shell
      MIT License
      482000Updated Feb 3, 2025Feb 3, 2025
    • LLM training code for MosaicML foundation models
      Python
      Apache License 2.0
      541000Updated Feb 3, 2025Feb 3, 2025
    • Bootstrap your large scale forecasting solution on Databricks with Many Models Forecasting (MMF)
      Python
      Other
      22000Updated Feb 3, 2025Feb 3, 2025
    • This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
      Python
      Apache License 2.0
      173000Updated Feb 3, 2025Feb 3, 2025
    • anomalib

      Public
      An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
      Python
      Apache License 2.0
      709000Updated Feb 3, 2025Feb 3, 2025
    • Security Analysis Tool (SAT) analyzes customer's Databricks account and workspace security configurations and provides recommendations that help them follow Databrick's security best practices. When a customer runs SAT, it will compare their workspace configurations against a set of security best practices and delivers a report.
      Python
      Other
      44000Updated Feb 3, 2025Feb 3, 2025
    • pixels

      Public
      Facilitates simple large scale processing of HLS Medical images, documents, zip files. Previously at https://github.com/dmoore247/pixels
      JavaScript
      Other
      18000Updated Feb 3, 2025Feb 3, 2025
    • State of the Art Natural Language Processing with John Snow Labs
      Scala
      Apache License 2.0
      718000Updated Feb 3, 2025Feb 3, 2025
    • This repo provides learning materials and production-ready code to build a high-quality RAG application using Databricks.
      Python
      Other
      91100Updated Feb 3, 2025Feb 3, 2025
    • LLM Bootcamp Series
      Python
      57000Updated Feb 3, 2025Feb 3, 2025
    • hls-tcga

      Public
      Load RNA expression profiles from TCGA and associated clinical data into the Databricks lakehouse platform, and subsequently perform diverse analyses on the dataset
      Python
      Other
      3000Updated Jan 21, 2025Jan 21, 2025
    • Notebooks for the Natural Language Processing with Transformers
      Jupyter Notebook
      Apache License 2.0
      1.3k000Updated Jan 21, 2025Jan 21, 2025
    • Burning Through Electronic Health Records in Real Time With Smolder
      Scala
      Other
      3000Updated Jan 21, 2025Jan 21, 2025
    • Use personalized images to enhance the output of an image generating model
      Python
      Other
      4000Updated Jan 21, 2025Jan 21, 2025
    • Media Mix Modeling Accelerator
      Python
      Other
      3000Updated Jan 21, 2025Jan 21, 2025
    • Low effort linking and easy de-duplication. Databricks ARC provides a simple, automated, lakehouse integrated entity resolution solution for intra and inter data linking.
      Python
      Other
      21000Updated Jan 21, 2025Jan 21, 2025
    • Gen AI application to estimate of the cost of payer treatment, service or procedure
      Python
      Other
      1000Updated Jan 21, 2025Jan 21, 2025
    • Examples of Databricks Asset Bundles
      Python
      Other
      39000Updated Jan 21, 2025Jan 21, 2025
    • Demonstrates how to use various generative AI forecasting models from within Databricks.
      Python
      Other
      7000Updated Jan 21, 2025Jan 21, 2025
    • This repository contains code example used and shared through Databricks Blog posts
      Python
      Other
      9000Updated Jan 21, 2025Jan 21, 2025
    • hub

      Public
      A library for transfer learning by reusing parts of TensorFlow models.
      Python
      Apache License 2.0
      1.7k000Updated Jan 21, 2025Jan 21, 2025
    • remorph

      Public
      Cross-compiler and Data Reconciler into Databricks Lakehouse
      Scala
      Other
      34000Updated Dec 29, 2024Dec 29, 2024
    • mosaic

      Public
      An extension to the Apache Spark framework that allows easy and fast processing of very large geospatial datasets.
      Jupyter Notebook
      Other
      70000Updated Nov 25, 2024Nov 25, 2024
    • Generative AI data curation and model patterns that take advantage of publicly available BioMedical articles.
      Python
      Other
      3000Updated Nov 25, 2024Nov 25, 2024