Cog: Containers for machine learning#

Cog is an open-source tool that lets you package machine learning models in a standard, production-ready container.

You can deploy your packaged model to your own infrastructure, or to Replicate.

Highlights#

📦 Docker containers without the pain. Writing your own Dockerfile can be a bewildering process. With Cog, you define your environment with a simple configuration file and it generates a Docker image with all the best practices: Nvidia base images, efficient caching of dependencies, installing specific Python versions, sensible environment variable defaults, and so on.
🤬️ No more CUDA hell. Cog knows which CUDA/cuDNN/PyTorch/Tensorflow/Python combos are compatible and will set it all up correctly for you.
✅ Define the inputs and outputs for your model with standard Python. Then, Cog generates an OpenAPI schema and validates the inputs and outputs.
🎁 Automatic HTTP prediction server: Your model's types are used to dynamically generate a RESTful HTTP API using a high-performance Rust/Axum server.
🚀 Ready for production. Deploy your model anywhere that Docker images run. Your own infrastructure, or Replicate.

How it works#

Define the Docker environment your model runs in with cog.yaml:

build:
  gpu: true
  system_packages:
    - "libgl1-mesa-glx"
    - "libglib2.0-0"
  python_version: "3.13"
  python_requirements: requirements.txt
predict: "predict.py:Predictor"

Define how predictions are run on your model with predict.py:

from cog import BasePredictor, Input, Path
import torch

class Predictor(BasePredictor):
    def setup(self):
        """Load the model into memory to make running multiple predictions efficient"""
        self.model = torch.load("./weights.pth")

    # The arguments and types the model takes as input
    def predict(self,
          image: Path = Input(description="Grayscale input image")
    ) -> Path:
        """Run a single prediction on the model"""
        processed_image = preprocess(image)
        output = self.model(processed_image)
        return postprocess(output)

In the above we accept a path to the image as an input, and return a path to our transformed image after running it through our model.

Now, you can run predictions on this model:

$ cog predict -i image=@input.jpg
--> Building Docker image...
--> Running Prediction...
--> Output written to output.jpg

Or, build a Docker image for deployment:

$ cog build -t my-classification-model
--> Building Docker image...
--> Built my-classification-model:latest

$ docker run -d -p 5000:5000 --gpus all my-classification-model

$ curl http://localhost:5000/predictions -X POST \
    -H 'Content-Type: application/json' \
    -d '{"input": {"image": "https://.../input.jpg"}}'

Or, combine build and run via the serve command:

$ cog serve -p 8080

$ curl http://localhost:8080/predictions -X POST \
    -H 'Content-Type: application/json' \
    -d '{"input": {"image": "https://.../input.jpg"}}'

Why are we building this?#

It's really hard for researchers to ship machine learning models to production.

Part of the solution is Docker, but it is so complex to get it to work: Dockerfiles, pre-/post-processing, Flask servers, CUDA versions. More often than not the researcher has to sit down with an engineer to get the damn thing deployed.

Andreas and Ben created Cog. Andreas used to work at Spotify, where he built tools for building and deploying ML models with Docker. Ben worked at Docker, where he created Docker Compose.

We realized that, in addition to Spotify, other companies were also using Docker to build and deploy machine learning models. Uber and others have built similar systems. So, we're making an open source version so other people can do this too.

Hit us up if you're interested in using it or want to collaborate with us. We're on Discord or email us at [email protected].

Prerequisites#

macOS, Linux or Windows 11. Cog works on macOS, Linux and Windows 11 with WSL 2
Docker. Cog uses Docker to create a container for your model. You'll need to install Docker before you can run Cog. If you install Docker Engine instead of Docker Desktop, you will need to install Buildx as well.

Install#

If you're using macOS, you can install Cog using Homebrew:

brew install replicate/tap/cog

You can also download and install the latest release using our install script:

# bash, zsh, and other shells
sh <(curl -fsSL https://cog.run/install.sh)

# fish shell
sh (curl -fsSL https://cog.run/install.sh | psub)

# download with wget and run in a separate command
wget -qO- https://cog.run/install.sh
sh ./install.sh

You can manually install the latest release of Cog directly from GitHub by running the following commands in a terminal:

sudo curl -o /usr/local/bin/cog -L "https://github.com/replicate/cog/releases/latest/download/cog_$(uname -s)_$(uname -m)"
sudo chmod +x /usr/local/bin/cog

Or if you are on docker:

RUN sh -c "INSTALL_DIR=\"/usr/local/bin\" SUDO=\"\" $(curl -fsSL https://cog.run/install.sh)"

Upgrade#

If you're using macOS and you previously installed Cog with Homebrew, run the following:

brew upgrade replicate/tap/cog

Otherwise, you can upgrade to the latest version by running the same commands you used to install it.

Development#

See CONTRIBUTING.md for how to set up a development environment and build from source.

Next steps#

Get started with an example model
Get started with your own model
Using Cog with notebooks
Using Cog with Windows 11
Take a look at some examples of using Cog
Deploy models with Cog
cog.yaml reference to learn how to define your model's environment
Prediction interface reference to learn how the Predictor interface works
Training interface reference to learn how to add a fine-tuning API to your model
HTTP API reference to learn how to use the HTTP API that models serve

Need help?#

Join us in #cog on Discord.

Contributors ✨#

Thanks goes to these wonderful people (emoji key):

_{Ben Firshman} 💻 📖	_{Andreas Jansson} 💻 📖 🚧	_{Zeke Sikelianos} 💻 📖 🔧	_{Rory Byrne} 💻 📖 ⚠️	_{Michael Floering} 💻 📖 🤔	_{Ben Evans} 📖	_{shashank agarwal} 💻 📖
_VictorXLR 💻 📖 ⚠️	_{hung anna} 🐛	_{Brian Whitman} 🐛	_JimothyJohn 🐛	_ericguizzo 🐛	_{Dominic Baggott} 💻 ⚠️	_{Dashiell Stander} 🐛 💻 ⚠️
_{Shuwei Liang} 🐛 💬	_{Eric Allam} 🤔	_{Iván Perdomo} 🐛	_{Charles Frye} 📖	_{Luan Pham} 🐛 📖	_TommyDew 💻	_{Jesse Andrews} 💻 📖 ⚠️
_{Nick Stenning} 💻 📖 🎨 🚇 ⚠️	_{Justin Merrell} 📖	_{Rurik Ylä-Onnenvuori} 🐛	_Youka 🐛	_{Clay Mullis} 📖	_Mattt 💻 📖 🚇	_{Eng Zer Jun} ⚠️
_BB 💻	_williamluer 📖	_{Simon Eskildsen} 💻	_F 🐛 💻	_{Philip Potter} 🐛 💻	_{Joanne Chen} 📖	_technillogue 💻
_{Aron Carroll} 📖 💻 🤔	_{Bohdan Mykhailenko} 📖 🐛	_{Daniel Radu} 📖 🐛	_{Itay Etelis} 💻	_{Gennaro Schiano} 📖	_{André Knörig} 📖	_{Dan Fairs} 💻

This project follows the all-contributors specification. Contributions of any kind welcome!