apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Rocket Chip Generator
A flow-style query language for SQL engines
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
Modern Load Testing as Code
The Scala 3 compiler, also known as Dotty.
simona is an agent-based discrete-event power system simulation model developed @ie3-institute
♞ lichess.org: the forever free, adless and open source chess server ♞
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
CMAK is a tool for managing Apache Kafka clusters
State of the Art Natural Language Processing
workbench identity and access management
Scala language server with rich IDE features 🚀
ZIO — A type-safe, composable library for async and concurrent programming in Scala
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more