magiCARP: Contrastive Authoring+Reviewing Pretraining
Welcome to the magiCARP API, the test bed used by EleutherAI for performing text/text bi-encoder experiments.
CARP, or contrastive authorship+reviewing pairings, was first outlined in Cut the CARP: Fishing for zero-shot story evaluation.
CARP presents a scalable method for performing zero-shot evaluation of stories and other mediums. Current CARP efforts at EleutherAI are primarily focused around controllable code generation. This repository will be updated with more experiments over the coming months as we try varying CARP architectures.
To train a model, run
poetry run python -m carp.pytorch.train --data_path="carp/dataset" --config_path ./base_config.yml
Finetuning via COOP and preference learning coming soon.