Spaces:
Running
AWS EC2 Basic Deployment
This is an example deployment to AWS EC2 Compute using terraform.
This deployment will do the following:
- Create a security group with required ports open (22 and 8000)
- Create EC2 instance with Ubuntu 22 and deploy Chroma using docker compose
- Create a data volume for Chroma data
- Mount the data volume to the EC2 instance
- Format the data volume with ext4
- Start Chroma
Requirements
Deployment with terraform
This deployment uses Ubuntu 22 as foundation, but you'd like to use a different AMI (non-Debian based image) you may have to adjust the startup script.
To find AWS EC2 AMIs you can use:
# 099720109477 is Canonical
aws ec2 describe-images \
--owners 099720109477 \
--filters 'Name=name,Values=ubuntu/images/*/ubuntu-jammy*' \
--query 'sort_by(Images,&CreationDate)[-1].ImageId'
2. Init your terraform state
terraform init
3. Deploy your application
Generate SSH key to use with your chroma instance (so you can login to the EC2):
Note: This is optional. You can use your own existing SSH key if you prefer.
ssh-keygen -t RSA -b 4096 -C "Chroma AWS Key" -N "" -f ./chroma-aws && chmod 400 ./chroma-aws
Set up your Terraform variables and deploy your instance:
#AWS access key
export TF_VAR_AWS_ACCESS_KEY=<AWS_ACCESS_KEY>
#AWS secret access key
export TF_VAR_AWS_SECRET_ACCESS_KEY=<AWS_SECRET_ACCESS_KEY>
#path to the public key you generated above (or can be different if you want to use your own key)
export TF_ssh_public_key="./chroma-aws.pub"
#path to the private key you generated above (or can be different if you want to use your own key) - used for formatting the Chroma data volume
export TF_ssh_private_key="./chroma-aws"
#set the chroma release to deploy
export TF_VAR_chroma_release=0.4.12
# AWS region to deploy the chroma instance to
export TF_VAR_region="us-west-1"
#enable public access to the chroma instance on port 8000
export TF_VAR_public_access="true"
#enable basic auth for the chroma instance
export TF_VAR_enable_auth="true"
#The auth type to use for the chroma instance (token or basic)
export TF_VAR_auth_type="token"
#optional - if you want to restore from a snapshot
export TF_VAR_chroma_data_restore_from_snapshot_id=""
#optional - if you want to snapshot the data volume before destroying the instance
export TF_VAR_chroma_data_volume_snapshot_before_destroy="true"
terraform apply -auto-approve
Note: Basic Auth is supported by Chroma v0.4.7+
4. Check your public IP and that Chroma is running
Get the public IP of your instance
terraform output instance_public_ip
Check that chroma is running (It should take up several minutes for the instance to be ready)
export instance_public_ip=$(terraform output instance_public_ip | sed 's/"//g')
curl -v http://$instance_public_ip:8000/api/v1/heartbeat
4.1 Checking Auth
Token
When token auth is enabled you can check the get the credentials from Terraform state by running:
terraform output chroma_auth_token
You should see something of the form:
PVcQ4qUUnmahXwUgAf3UuYZoMlos6MnF
You can then export these credentials:
export CHROMA_AUTH=$(terraform output chroma_auth_token | sed 's/"//g')
Using the credentials:
curl -v http://$instance_public_ip:8000/api/v1/collections -H "Authorization: Bearer ${CHROMA_AUTH}"
Basic
When basic auth is enabled you can check the get the credentials from Terraform state by running:
terraform output chroma_auth_basic
You should see something of the form:
chroma:VuA8I}QyNrm0@QLq
You can then export these credentials:
export CHROMA_AUTH=$(terraform output chroma_auth_basic | sed 's/"//g')
Using the credentials:
curl -v http://$instance_public_ip:8000/api/v1/collections -u "${CHROMA_AUTH}"
Note: Without
-u
you should be getting 401 Unauthorized response
4.2 Connect (ssh) to your instance
To SSH to your instance:
ssh -i ./chroma-aws ubuntu@$instance_public_ip
5. Destroy your Chroma instance
You will need to change prevent_destroy
to false
in the aws_ebs_volume
in chroma.tf
.
terraform destroy -auto-approve
Extras
You can visualize your infrastructure with:
terraform graph | dot -Tsvg > graph.svg
Note: You will need graphviz installed for this to work