Welcome to Ducho’s documentation!

What is Ducho

Ducho v2.0 is a Python framework for the extraction of multimodal features for recommendation. It provides a unified interface to most of the common libraries for deep learning (e.g., TensorFlow, PyTorch, Transformers, Sentence-Transformers) to extract high-level features from items (e.g., product images/descriptions) and user-item interactions (e.g., users reviews). It is highly configurable through a YAML-based configuration file (which may be override by input arguments from the command line in case). Users can indicate the source from which to extract the multimodal features (i.e., items/interactions), the modalities (i.e., visual/textual/audio/multiple), and the list of models along with output layers and preprocessing steps to extract the features. Moreover, with the new version of Ducho, users can conduct extractions by utilizing their own pretrained models.

system schema

system schema

Backends-Modalities available settings in Ducho v2.0

Sources

Backends

Items

Interactions

TensorFlow

PyTorch

Transformers

Sentence-Transformers

Audio

_images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png

Visual

_images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png

Text

_images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png

Visual-Textual

_images/checkbox-mark.png _images/checkbox-mark.png _images/checkbox-mark.png

Indices and tables