Live Training for Open-source Big Models


What’s New

  • 2022/05/25 The training plan for CPM-Live is now published. Look forward to the training!

Training Plan

Considering the scale of data and computing resources, CPM-Live will start with a 10B model training, which we named CPM-Ant. The training of CPM-Ant will start on May 29, 2022, and the entire training process is expected to last five months.

During training we will do:

  • Real-time: Display model training metrics
  • Every day: Release the model training log
  • Every week: Deal with discussions and feedback from the community
  • Irregularly: Release checkpoints during model training which everyone can download

During training you can:

  • Raise your model proposal: Have better ideas on model architecture, training methods, or data sources? You can put forward your model proposal in the community. If the proposal receives more support and is practically feasible, we will add it to the model we are training, so that CPM-Live can learn continuously and progress with the help of everyone.

  • Develop your application: You can submit your initial ideas, prototypes, development code, or finished apps, which are based on CPM-Live, to the community. We will exhibit the most popular apps on the website.

  • Chat on the forum: You can talk about anything related to big models in our forums, such as academic research, engineering implementation, tool use, application design, etc. No matter whether you are experienced or not, we believe everyone can benefit from positive and open discussions.

  • Download the resource: Once the model training is complete, you are free to download the model parameters under an open use license. CPM-Live uses an open license that includes permission for commercialization. With model compression and inference acceleration tools, you can experience the power of big models on your own PC!


Our community is based on GitHub Discussions.

Read the first post and start your exploration on CPM-Live!


View Github