Oryx 2 is a realization of the lambda architecture built on Apache Spark and Apache Kafka, but with specialization for real-time large scale machine learning. It is a framework for building applications, but also includes packaged, end-to-end applications for collaborative filtering, classification, regression and clustering.
Proceed to the Oryx 2 site for full documentation.
Just looking to deploy a ready-made, end-to-end application for collaborative filtering, clustering or classification? Easy. Proceed directly to:
- Prepare your Hadoop cluster with Cluster Setup
- Get a Release
- Prepare a config file from the Configuration Reference
- Run the binaries with Running Oryx
- Learn about the REST API endpoints you can call in the API Endpoint Reference
Developers can consume Oryx 2 as a framework for building custom applications as well. Following the architecture overview below, proceed to Making an Oryx App to learn how to create a new application. You can review a module diagram as well to understand the project structure.