|
DocWire SDK
DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Supports nearly 100 data formats, including email boxes and OCR. Boost efficiency in text extraction, web data extraction, data mining, document analysis. Offline processing possible for security and confidentiality
|
Content type detection chain element. More...
#include <content_type.h>
Public Member Functions | |
| detector (ref_or_owned< by_signature::database > signatures_db_to_use=by_signature::database{}) | |
| Constructs a new detector with the given database of signatures. More... | |
| continuation | operator() (message_ptr msg, const message_callbacks &emit_message) override |
| bool | is_leaf () const override |
| Check if chain element is a leaf (last element which doesn't produce any messages). At this moment only exporters are leafs. More... | |
Public Member Functions inherited from docwire::chain_element | |
| chain_element (chain_element &&)=default | |
| chain_element & | operator= (chain_element &&)=default |
| virtual bool | is_generator () const |
Additional Inherited Members | |
Protected Types inherited from docwire::with_pimpl< chain_element > | |
| using | impl_type = pimpl_impl< chain_element > |
Protected Member Functions inherited from docwire::with_pimpl< chain_element > | |
| impl_type * | create_impl (Args &&... args) |
| with_pimpl (Args &&... args) | |
| with_pimpl (with_pimpl< chain_element > &&other) noexcept | |
| with_pimpl (std::nullptr_t) | |
| with_pimpl & | operator= (with_pimpl &&other) noexcept |
| impl_type & | impl () |
| const impl_type & | impl () const |
Content type detection chain element.
Detects and assigns content types to the provided data source using various detection strategies.
This class is a chain element that detects and assigns content types to data sources using the following detection methods:
Definition at line 106 of file content_type.h.
|
inline |
Constructs a new detector with the given database of signatures.
The detector will use the provided database of signatures for content type detection. If no database is provided, it will be created and loaded.
| signatures_db_to_use | The database of signatures to be used for content type detection. |
Definition at line 120 of file content_type.h.
|
inlineoverridevirtual |
Check if chain element is a leaf (last element which doesn't produce any messages). At this moment only exporters are leafs.
Implements docwire::chain_element.
Definition at line 145 of file content_type.h.