The purpose of using vectors is to improve quality, but that takes much more than a similarity lookup. Search engines are built for that, databases are not.
With Vespa, Vinted managed to halve the number of servers, slash query latency by 2.5x, indexing latency by 3x, and increase ranking depth by more than 3x.
Advances in Vespa features and performance include Optimized MaxSim with Hamming distance, IDE Support, Pyvespa features, and new notebooks with ColPali examples.
Advances in Vespa features and performance include new pyvespa and Vespa CLI features for queries, feeding, and deployment, shared match-phase heap, Chinese tokenization with Jieba, and new ranking framework features....