We've just announced IQ AI.
Chutes is a serverless compute platform designed for deploying, scaling, and running open-source artificial intelligence (AI) models. Developed by Rayon Labs, it operates on a decentralized, open-source infrastructure to provide AI inference and other computational services for developers and enterprises. [1] [2]
Chutes offers a serverless platform for developers to run open-source AI models without managing the underlying infrastructure. Its decentralized architecture is designed for scalability and processes trillions of tokens monthly. The platform simplifies access to high-performance AI computation by keeping popular models "permanently hot" for immediate, low-latency inference. It also provides the flexibility for users to deploy their own custom models. [1]
Chutes operates as a serverless compute layer for AI tasks. This model abstracts away server management, allowing developers to focus on their application code. The platform is responsible for allocating resources, scaling, and managing the execution environment for each job. [1]
The infrastructure is described as decentralized and open-source. This suggests a distributed network of compute resources rather than a centralized data center model. This design can contribute to resilience and potentially lower operational costs. The platform is engineered to handle various AI workloads, with a primary focus on model inference. [1]
A key technical feature is the "permanently hot" model system. By keeping frequently used AI models loaded and active, Chutes aims to minimize the cold-start delays often associated with serverless functions, making it suitable for real-time applications. The platform's team monitors for new open-source model releases and works to integrate them quickly, often making them available on the platform within a short time after their public release. [1]
Chutes offers a range of services centered around AI model execution and plans to expand its capabilities. [1]
The primary service is high-performance inference for a variety of AI models. Users can access these models via an API to integrate AI capabilities into their own applications. The platform provides analytics for monitoring usage and performance. [1]
Chutes supports a diverse set of AI model types, allowing for a wide array of applications. The platform categorizes its supported models into several groups:
This range of support indicates the platform's goal to be a comprehensive resource for various AI development needs. [1]
Chutes has announced several upcoming features to broaden its service offerings:
These planned additions suggest a strategy to serve a wider range of computational needs, from individual developers to enterprise clients with security-sensitive workloads. [1]
Chutes provides access to numerous open-source models from various research labs and companies. The platform highlights its ability to quickly host new and popular models. Some of the model providers featured on the platform include DeepSeek, Mistral AI, Microsoft, Google (Gemma), Qwen (Alibaba), and Moonshot AI (Kimi). Specific models available include variants of DeepSeek V3, Mistral Small, and NousResearch's DeepHermes. [1] [2]
The company's website lists several projects and companies that use or integrate with its services, including:
These integrations demonstrate the platform's adoption within the AI and decentralized technology ecosystems. [1]
Chutes is developed and operated by Rayon Labs. The platform's official X (formerly Twitter) account was created in November 2024, and it actively posts updates regarding new features, model availability, and pricing changes. [1] [2]
Chutes utilizes a subscription-based pricing model with several tiers, supplemented by a pay-as-you-go (PAYG) option for usage that exceeds plan limits. The pricing structure was updated in August 2025 to provide fixed monthly plans. [2]
The available subscription tiers are:
All paid tiers include unlimited API keys, access to all available models, and use of the Chutes Chat and Chutes Studio applications. The platform also offers a program for startups, providing up to $20,000 in free credits to eligible companies. [1]