Vespa Product Updates, May 2021
In the previous update, we mentioned document/v1/, weakAnd.replace, Improved feed-block at full node and Reduced memory at stop/restart. Subscribe to the mailing list to get these updates delivered to your inbox.
This month, we’re excited to share the following updates:
Bfloat16 and int8 tensor value types
Since Vespa-7.396.22, bfloat16 and int8 are supported as tensor cell types. This enables model serving of larger models without increasing memory use, or reducing cost by using lower precision types (e.g. 50% smaller with bfloat16 compared to 32 bit float). Find details in the value type reference and learn more about performance considerations. For int8, one can use a compact hex-form string field to write indexed tensors representing binary data, see JSON feed format.
Case-sensitive attribute search
Search in string attributes is by default done in word match mode. This means that the attribute’s value is stored unchanged, and subsequent matching is done by on-the-fly lowercasing query terms / attribute data + a few heuristics to filter out punctuation. Some use cases require case sensitive matching, e.g. ID lookup, and is enabled by using the cased match mode - available since Vespa-7.397.65. Read more
Attributes with hashed dictionary.
Attributes are in-memory fields, in its simplest form stored in a table-like data structure. By using fast-search, one can speed up lookups by adding a dictionary, default a b-tree, to avoid a full table scan. As of Vespa-7.397.65, a hash-based dictionary can be configured - it is intended for use in fields with many unique terms with few occurrences (i.e. short postinglists), where the dictionary lookup cost is significant. Combine with cased match mode for best performance. Read more
Hamming distance metric for ANN search
Since Vespa-7.401.18, a
hamming distance metric
tensor<int8> cell types is supported for Approximate Nearest Neighbor Search using Vespa’s
nearestNeighbor query operator.
This distance metric is useful for computing edit distance between two sentences.
About Vespa: Largely developed by Yahoo engineers, Vespa is an open source big data processing and serving engine. It’s in use by many products, such as Yahoo News, Yahoo Sports, Yahoo Finance, and the Verizon Media Ad Platform. Thanks to feedback and contributions from the community, Vespa continues to grow.