This is the official repository for the Kubernetes version of Nutanix GPT-in-a-Box.
Nutanix GPT-in-a-Box is a new turnkey solution that includes everything needed to build AI-ready infrastructure for organizations wanting to implement GPT capabilities while maintaining control of their data and applications.
This new solution includes:
- Software-defined Nutanix Cloud Platform™ infrastructure supporting GPU-enabled server nodes for seamless scaling of virtualized compute, storage, and networking supporting both traditional virtual machines and Kubernetes-orchestrated containers
- Files and Objects storage; to fine-tune and run a choice of GPT models
- Open source software to deploy and run AI workloads including PyTorch framework & KubeFlow MLOps platform
- The management interface for enhanced terminal UI or standard CLI
- Support for a curated set of LLMs including Llama2, Falcon and MPT
Refer to the official GPT-in-a-Box Documentation to deploy and validate the inference server on Kubernetes cluster
All source code and other contents in this repository are covered by the Nutanix License and Services Agreement, which is located at https://www.nutanix.com/legal/eula