|
DocWire SDK
DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Supports nearly 100 data formats, including email boxes and OCR. Boost efficiency in text extraction, web data extraction, data mining, document analysis. Offline processing possible for security and confidentiality
|
A chain element that generates embeddings for input text using a local AI model. More...
#include <local_ai_embed.h>
Public Member Functions | |
| embed (std::shared_ptr< model_runner > model_runner, std::string prefix) | |
| Construct a local AI embed chain element with a specific model runner and prefix. More... | |
| embed (std::string prefix) | |
| Construct a local AI embed chain element with a default model runner and prefix. More... | |
| continuation | operator() (message_ptr msg, const message_callbacks &emit_message) override |
| bool | is_leaf () const override |
| Check if chain element is a leaf (last element which doesn't produce any messages). At this moment only exporters are leafs. More... | |
Public Member Functions inherited from docwire::chain_element | |
| chain_element (chain_element &&)=default | |
| chain_element & | operator= (chain_element &&)=default |
| virtual bool | is_generator () const |
Static Public Attributes | |
| static const std::string | e5_passage_prefix |
| Common prefix for passage embeddings with E5 models. | |
| static const std::string | e5_query_prefix |
| Common prefix for query embeddings with E5 models. | |
Additional Inherited Members | |
Protected Types inherited from docwire::with_pimpl< chain_element > | |
| using | impl_type = pimpl_impl< chain_element > |
Protected Types inherited from docwire::with_pimpl< embed > | |
| using | impl_type = pimpl_impl< embed > |
Protected Member Functions inherited from docwire::with_pimpl< chain_element > | |
| impl_type * | create_impl (Args &&... args) |
| with_pimpl (Args &&... args) | |
| with_pimpl (with_pimpl< chain_element > &&other) noexcept | |
| with_pimpl (std::nullptr_t) | |
| with_pimpl & | operator= (with_pimpl &&other) noexcept |
| impl_type & | impl () |
| const impl_type & | impl () const |
Protected Member Functions inherited from docwire::with_pimpl< embed > | |
| impl_type * | create_impl (Args &&... args) |
| with_pimpl (Args &&... args) | |
| with_pimpl (with_pimpl< embed > &&other) noexcept | |
| with_pimpl (std::nullptr_t) | |
| with_pimpl & | operator= (with_pimpl &&other) noexcept |
| impl_type & | impl () |
| const impl_type & | impl () const |
A chain element that generates embeddings for input text using a local AI model.
This class is a chain element that takes a model_runner to generate a vector embedding for a given text. It is designed to work with sentence-transformer models like multilingual-e5-small.
Definition at line 31 of file local_ai_embed.h.
|
explicit |
Construct a local AI embed chain element with a specific model runner and prefix.
| model_runner | The model runner to use for generating embeddings. |
| prefix | The string to prepend to the input text. Use an empty string for no prefix. |
|
explicit |
Construct a local AI embed chain element with a default model runner and prefix.
This constructor initializes the embedder with a default model_runner configured to use the multilingual-e5-small-ct2-int8 model.
| prefix | The string to prepend to the input text. Use an empty string for no prefix. |
|
inlineoverridevirtual |
Check if chain element is a leaf (last element which doesn't produce any messages). At this moment only exporters are leafs.
Implements docwire::chain_element.
Definition at line 58 of file local_ai_embed.h.