OA’s GPT-f do the job on utilizing GPT for MetaMath formal theorem-proving notes that they use the normal GPT-2 BPE but "preliminary experimental results display feasible gains with specialised tokenization methods.
Feel free to visit my web site:
Minecrafting.Co.uk