ByteBrief
Skimming the internet so you don't have to
Token-count-based Batching: Faster, Cheaper Embedding Inference for Queries | ByteBrief