Nostalgebraist talked about the serious weirdness of BPEs and how they improve chaotically centered on whitespace, capitalization, and context for GPT-2, with a followup article for GPT-3 on the even weirder encoding of quantities sans commas.
My page ::
Ads.Adcyprus.com