Understanding Embeddings: A Short Guide with an Example

Embeddings are an invisible, yet important part of many technologies we encounter. From internet search engines, through recommendation systems and advertisement personalization, to advanced analyses of images, videos, and technologies for speech and sound recognition – embeddings play a key role everywhere. In this article, we will explain how embeddings work and how they facilitate, and enrich our daily experiences with technology.

As I mentioned above, vector embeddings are a popular technique to represent information in a format (typically as a vector of numerical values) that can be easily processed by algorithms, especially deep learning models. This ‘information’ can be text, pictures, video, and audio.

For example, the conversion of the word ‘dog’ into a numerical vector representation could look like this:

What is the embedding dimension?

A crucial factor in determining the quality and effectiveness of the embedding is the embedding dimension. Generally, the term ‘dimensionality of word embedding’ refers to the total count of dimensions used to define a word’s vector representation. This number is usually established during the development of the word embedding and indicates how many distinct features are included in the vector representation of the word.

For text embeddings, these vectors are constructed in a way that captures the semantic meaning of the text. This ensures that words or sentences conveying similar meanings are close to each other in the embedded space, often referred to as a vector space.

What does that mean? Here’s a simple example

Let’s say we have a space with only two dimensions – [x, y], where x represents sex and y represents activity.

Now, for example, by asking the question ‘Who is walking?’ the embeddings will search for vectors that include the ‘Walk’ sentence in the y dimension.

We can observe that the sentence “Walk” is associated with a woman, a man, a boy, and a girl. This means that all of them are walking. And thus, we get our answer: A woman, a man, a boy, and a girl are walking.

Remember, this is a very simplified example with only two dimensions of meaning. In reality, the more such dimensions there are, the better our embedding is at guessing the meanings of saved sentences as words or phrases.

Ready for more embedding tutorials?

Delve deeper into various applications of embedding in AI:

Retrieval Augmented Generation (RAG) in Machine Learning Explained

By Paweł Polak, Fullstack Developer @ bright inventions.

Stiri similare

Chicago woman charged with biting cop at Hammond Walmart

Así ha sido el último punto de Nadal en el Mutua Madrid Open y sus partidos contra Djokovic y Federer en la Caja Mágica

Daily News boys athlete of the week: Dylan Volantis, Westlake

The Cheyenne Supercomputer is going for a fraction of its list price at auction right now

City celebrates townhome transformation in Nob Hill

Top battleground Senate race heats up as party-backed Republican faces onslaught from former Trump official

Understanding Embeddings: A Short Guide with an Example

What is the embedding dimension?

What does that mean? Here’s a simple example

Ready for more embedding tutorials?

Related

Leave a Reply Cancel reply

What is the embedding dimension?

What does that mean? Here’s a simple example

Ready for more embedding tutorials?

Share on:

Related

Leave a Reply Cancel reply

Stiri similare