Vespa Blog

AI + data, online

All Stories

Using approximate nearest neighbor search in real world applications

Using approximate nearest neighbor search in real world applications

From text search and recommendation to ads and online dating, ANN search rarely works in isolation.

Lester Solbakken
18 Dec 2020

Vespa Product Updates, December 2020

Photo by Donald Giannatti on Unsplash

Vespa Product Updates, December 2020

Advances in Vespa features and performance include improved tensor ranking performance, Apache ZooKeeper integration, Vespa Python API for researchers and ONNX integration.

Kristian Aune
17 Dec 2020

Stateful model serving: how we accelerate inference using ONNX Runtime

Stateful model serving: how we accelerate inference using ONNX Runtime

There's a difference between stateless and stateful model serving.

Lester Solbakken
14 Dec 2020

Fine-tuning a BERT model for search applications

Photo by Alice Dietrich on Unsplash

Fine-tuning a BERT model for search applications

How to ensure training and serving encoding compatibility.

Thiago Martins
25 Nov 2020

From research to production: scaling a state-of-the-art machine learning system

From research to production: scaling a state-of-the-art machine learning system

How we implemented a production-ready question-answering application and reduced response time by more than two orders of magnitude.

Lester Solbakken
12 Nov 2020

Fine-tuning a BERT model with transformers

Photo by Samule Sun on Unsplash

Fine-tuning a BERT model with transformers

Setup a custom Dataset, fine-tune BERT with Transformers Trainer and export the model via ONNX.

Thiago Martins
12 Nov 2020

Vespa Product Updates, October 2020

Photo by Ilya Pavlov on Unsplash

Vespa Product Updates, October 2020

Improvement to Vespa feeding APIs

Kristian Aune
30 Oct 2020

Vespa Product Updates, September 2020

Photo by ThisisEngineering RAEng on Unsplash

Vespa Product Updates, September 2020

Introducing ONNX-Runtime, Hamming Distance Metric, Conditional Update Performance Improvements and Compressed Transaction Log with Synced Ack

Kristian Aune
01 Oct 2020

Efficient open-domain question-answering on Vespa.ai

Efficient open-domain question-answering on Vespa.ai

In this post, we reproduce the state-of-the-art baseline for retrieval-based question-answering systems within a single, scalable production ready application on Vespa.ai.

Lester Solbakken
30 Sep 2020

Vespa Product Updates, August 2020

Introducing NLP with Transformers, Grafana how-to, Improved GEO Search Support, Query Profile Variants Optimizations, & Build on Debian 10

Kristian Aune
27 Aug 2020

Vespa Product Updates, June 2020

Announcing support for approximate nearest neighbor vector search which can be combined with filters and text search with state-of-the art performance

Kristian Aune
05 Jul 2020

Introducing NLP with Transformers on Vespa

Introducing NLP with Transformers on Vespa

We’ve been working a lot lately on evaluating Transformer models in Vespa. Here we show how and share a bit on how we view the benefits of inference in Vespa....

Lester Solbakken
02 Jul 2020

Never miss a story from us, subscribe to our newsletter