GPT 4o like bot.
Experiment with and compare different tokenizers
Efficient quantized retrieval over Wikipedia