Wednesday, September 3, 2025
HomeData Modelling & AIBig dataVespa Newsletter, June 2022

Vespa Newsletter, June 2022


In the previous update,
we mentioned tensor formats, grouping improvements, new query guides,
modular rank profiles and pyvespa docker image and deployments.
Today, we’re excited to share the following updates:

Vespa 8

Vespa 8 is released. Vespa is now on Java 17 and
CentOS Stream 8.
Read more about what this means for you in the blog post.

Pre/Post ANN filtering support

Approximate Nearest Neighbor is a popular feature in Vector Search applications, also supported in Vespa.
Vespa has integral support for combining ANN search with filters,
like “similar articles to this, in US market, not older than 14 days”.
From Vespa 7.586.113, users can configure whether to use pre- or post-filtering, with thresholds.
This enables a much better toolset to trade off precision with performance, i.e. balance cost and quality.
Read more in constrained-approximate-nearest-neighbor-search.

Fuzzy matching

Thanks to alexeyche, Vespa supports fuzzy query matching since 7.585 –
a user typing “spageti” will now match documents with “spaghetti”.
This is implemented using Levenshtein edit distance search –
e.g. one must make two “edits” (one-character changes) to make “spaghetti” from “spageti”.
Find the full contribution in #21689 and documentation at
query-language-reference.html#fuzzy.

Embedding support

A common technique in modern big data serving applications is to map the subject data – say, text or images –
to points in an abstract vector space and then do computation in that vector space.
For example, retrieve similar data by finding nearby points in the vector space,
or using the vectors as input to a neural net.
This mapping is usually referred to as embedding
read more about Vespa’s built-in support.

Tensors and ranking

fast-rank
enables ranking expression evaluation without de-serialization, to decrease latency, on the expense of more memory used.
Supported for tensor field types with at least one mapped dimension.

Tensor short format
is now supported in the /document/v1 API.

Support for importing onnx models in rank profiles is added.

Blog posts and training videos

Find great Vespa blog posts on
constrained ANN-search,
hybrid billion scale vector search,
and Lester Solbakken + Jo Kristian Bergum at the
Berlin Buzzwords conference
follow Jo Kristian for industry leading commentary.

New training videos for Vespa startup troubleshooting and auto document redistribution
are available at the YouTube channel @vespaai:


Vespa.ai: Troubleshooting startup, singlenode


Vespa.ai: Troubleshooting startup, multinode


Vespa.ai: Bucket distribution - intro

RELATED ARTICLES

Most Popular

Dominic
32260 POSTS0 COMMENTS
Milvus
81 POSTS0 COMMENTS
Nango Kala
6625 POSTS0 COMMENTS
Nicole Veronica
11795 POSTS0 COMMENTS
Nokonwaba Nkukhwana
11854 POSTS0 COMMENTS
Shaida Kate Naidoo
6746 POSTS0 COMMENTS
Ted Musemwa
7023 POSTS0 COMMENTS
Thapelo Manthata
6694 POSTS0 COMMENTS
Umr Jansen
6714 POSTS0 COMMENTS