The big data serving engine - Store, search, rank and organize big data at user serving time. Vespa is an engine for low-latency computation over large data sets. It stores and indexes your data and executes distributed queries including evaluation of machine-learned models over many data points in real time.
Use cases such as search, recommendation and personalization need to select a subset of data in a large corpus, evaluate machine-learned models over the selected data, organize and aggregate it and return it, typically in less than 100 milliseconds, all while the data corpus is continuously changing.
This is hard to do, especially with large corpuses that needs to be distributed over multiple nodes and evaluated in parallel. Vespa is a platform which performs these operations for you. It has been in development for many years and is used on a number of large internet services and apps which serve hundreds of thousands of queries from Vespa per second.
To get started using Vespa pick one of the quick start documents:
- Run on a Mac or Linux machine using Docker
- Run on a Windows machine using Docker
- Run on a Mac or Linux machine using VirtualBox+Vagrant
- Multinode install on AWS EC2
- Multinode install on AWS ECS
- The application created in the quickstart is fully functional and production ready, but you may want to add more nodes for redundancy.
- Try the Blog search and recommendation tutorial to learn more about using Vespa
- See developing applications on adding your own Java components to your Vespa application.
- Vespa APIs is useful to understand how to interface with Vespa
- Explore the sample applications
Full documentation is available on https://docs.vespa.ai.
We welcome contributions! See CONTRIBUTING.md to learn how to contribute.
If you want to contribute to the documentation, see https://github.com/vespa-engine/documentation
You do not need to build Vespa to use it, but if you want to contribute you need to be able to build the code. This section explains how to build and test Vespa. To understand where to make changes, see Code-map.md. Some suggested improvements with pointers to code are in TODO.md.
C++ and Java building is supported on CentOS 7. The Java source can also be built on any platform having Java 11 and Maven installed. Use the following guide to setup a complete development environment using Docker for building Vespa, running unit tests and running system tests: Vespa development on CentOS 7.
export MAVEN_OPTS="-Xms128m -Xmx1024m"
./bootstrap.sh java
mvn install --threads 1C
Use this if you only need to build the Java modules, otherwise follow the complete development guide above.
See Building Vespa RPM for details.
Code licensed under the Apache 2.0 license. See LICENSE for terms.