Skip to content

Constants

gllm_multimodal.constants

Constants for multimodal operations in Gen AI applications.

DEFAULT_ROUTER_EMBEDDING_MODEL = 'bedrock/us.twelvelabs.marengo-embed-2-7-v1:0' module-attribute

CaptionConstants

Constants for caption operations in Gen AI applications.

ATTACHMENTS_CONTEXT = 'attachments_context' class-attribute instance-attribute

CAPTION_DEFAULT_JSON_KEY = 'captions' class-attribute instance-attribute

DEFAULT_NUMBER_OF_CAPTIONS = 5 class-attribute instance-attribute

DOMAIN_KNOWLEDGE = 'domain_knowledge' class-attribute instance-attribute

IMAGE_DESCRIPTION = 'image_description' class-attribute instance-attribute

IMAGE_METADATA = 'image_metadata' class-attribute instance-attribute

IMAGE_ONE_LINER = 'image_one_liner' class-attribute instance-attribute

IS_STRUCTURED_CAPTION = 'is_structured_caption' class-attribute instance-attribute

NUMBER_OF_CAPTIONS = 'number_of_captions' class-attribute instance-attribute

CaptionOutputFormattingStrategy

Bases: StrEnum

Defines how output should be formatted.

STRUCTURED = 'structured' class-attribute instance-attribute

TEXT = 'text' class-attribute instance-attribute

ConverterResultTag

Bases: StrEnum

Defines converter result tags.

CAPTION = 'captions' class-attribute instance-attribute

MERMAID = 'mermaid' class-attribute instance-attribute

STRUCTURED_CAPTION = 'structured_caption' class-attribute instance-attribute

ExifConstants

Constants for EXIF tag operations in image metadata extraction.

GPS_LATITUDE = 'GPS GPSLatitude' class-attribute instance-attribute

GPS_LATITUDE_REF = 'GPS GPSLatitudeRef' class-attribute instance-attribute

GPS_LONGITUDE = 'GPS GPSLongitude' class-attribute instance-attribute

GPS_LONGITUDE_REF = 'GPS GPSLongitudeRef' class-attribute instance-attribute

GPS_SOUTH = 'S' class-attribute instance-attribute

GPS_WEST = 'W' class-attribute instance-attribute

LATITUDE = 'latitude' class-attribute instance-attribute

LONGITUDE = 'longitude' class-attribute instance-attribute

ImageToTextConstants

Constants for image-to-text operations in Gen AI applications.

FILENAME = 'filename' class-attribute instance-attribute

NOT_GIVEN = 'not_given' class-attribute instance-attribute

NO_IMAGE_FILENAME = 'no_image_filename' class-attribute instance-attribute

Modality

Bases: StrEnum

Defines supported modalities.

AUDIO = 'audio' class-attribute instance-attribute

IMAGE = 'image' class-attribute instance-attribute

TEXT = 'text' class-attribute instance-attribute

VIDEO = 'video' class-attribute instance-attribute

ModalityConverterApproach

Bases: StrEnum

Defines supported modality converter approach types.

GEMINI = 'gemini' class-attribute instance-attribute

GOOGLE_CLOUD = 'google_cloud' class-attribute instance-attribute

LM_BASED = 'lm_based' class-attribute instance-attribute

PROSA = 'prosa' class-attribute instance-attribute

WHISPER = 'whisper' class-attribute instance-attribute

YOUTUBE = 'youtube' class-attribute instance-attribute

ModalityConverterBuildStrategy

Bases: StrEnum

Defines how a modality converter should be constructed from its configuration.

KWARGS = 'kwargs' class-attribute instance-attribute

LMRP = 'lmrp' class-attribute instance-attribute

PRESET = 'preset' class-attribute instance-attribute

ModalityConverterTask

Bases: StrEnum

Defines supported modality converter tasks.

AUTO = 'auto' class-attribute instance-attribute

CAPTIONING = 'captioning' class-attribute instance-attribute

MERMAID = 'mermaid' class-attribute instance-attribute

TRANSCRIPT = 'transcript' class-attribute instance-attribute

ModalityTransformerRouterPreset

Bases: StrEnum

Defines supported modality transformer router presets.

DOMAIN_SPECIFIC = 'domain_specific' class-attribute instance-attribute

MULTIMODAL = 'multimodal' class-attribute instance-attribute

ModalityTransformerType

Bases: StrEnum

Defines supported modality transformer types.

GENERIC = 'generic' class-attribute instance-attribute

STANDARD = 'standard' class-attribute instance-attribute

WhisperResponseKeys

String keys used when parsing OpenAI Whisper API responses.

AUDIO_TRANSCRIPTS = 'audio_transcripts' class-attribute instance-attribute

LANGUAGE = 'language' class-attribute instance-attribute

SEGMENTS = 'segments' class-attribute instance-attribute

TEXT = 'text' class-attribute instance-attribute

WORDS = 'words' class-attribute instance-attribute