OA’s GPT-f do the job on employing GPT for MetaMath formal theorem-proving notes that they use the regular GPT-2 BPE but "preliminary experimental outcomes display achievable gains with specialized tokenization tactics.
Review my web site;
webcam sites