2024

July

Finetune Llama 3.1 on Your Infra

Zhanghao Wu, Romil Bhardwaj, Zongheng Yang·Jul 23, 2024·5 min read

AI on Kubernetes Without the Pain

Romil Bhardwaj·Jul 11, 2024·12 min read

June

SkyPilot 0.6: Managed Jobs API, SkyServe on Kubernetes, Spot + On-demand mixing, Paperspace support

SkyPilot Team·Jun 4, 2024·4 min read

February

Introducing SkyServe: 50% Cheaper AI Serving on Any Cloud with High Availability

Tian Xia, Zhanghao Wu, Ziming Mao, Zongheng Yang·Feb 20, 2024·10 min read

2023

December

Scaling Mixtral LLM Serving with High GPU Availability and Cost Efficiency

Zhanghao Wu·Dec 21, 2023·8 min read

September

Scaling AI Robotics on the Cloud

Rocky Duan (CTO, Covariant), Clay Rosenthal (Production Engineer, Covariant), Marco Almeida (TLM of Production Engineering Team, Covariant), Chris Colby (Head of Software and Research, Covariant)·Sep 26, 2023·10 min read

August

Finetuning Llama 2 in your own cloud environment, privately

Zhanghao Wu, Wei-Lin Chiang, Zongheng Yang·Aug 2, 2023·12 min read

June

Serving LLM 24x Faster On the Cloud with vLLM and SkyPilot

Woosuk Kwon, Zhuohan Li, Zhanghao Wu·Jun 29, 2023·5 min read

May

SkyPilot 0.3: LLM support and unprecedented GPU availability across more clouds

SkyPilot Team·May 30, 2023·6 min read

Analyzing the Whole Mouse Brain Atlas on the Cloud With SkyPilot [User Post]

Hanqing Liu·May 1, 2023·12 min read

March

Run LLaMA LLM chatbots on any cloud with one click

Woosuk Kwon, Zongheng Yang·Mar 20, 2023·7 min read

2022

November

SkyPilot: ML and Data Science on any cloud with massive cost savings

Zongheng Yang, Ion Stoica·Nov 16, 2022·9 min read