The Features of AVOBMAT

textmining

Text and data mining large library & research databases

content analysis

Content analyis

Metadata analysis and visualization

Interactive metadata analysis & visualizations

preprocessing options

18 Preprocessing options

Topic modeling, correlations, visualizations

Network analysis

Network analysis

21 available languages

N-gram viewer

Metadata

Over 200 metadata fields

Individual configuration

Individual configuration of all analytical tools

TagSpheres (context of a word)

Metadata enrichment

Google Drive import

TEI XML import

Significant text analysis

Significant text analysis (comparing corpora)

language detection

Automatic language detection

search modes

Fast advanced, fuzzy, proximity and commandline searches

Keyword in Context / Concordance

gender analysis

Gender analysis of authors

Named Entity Recognition

Named Entity Linking (to Wikidata) and Disambiguation

Reproducibility (import/export)

Part-of-speech tagging

lexical diversity metrics

8 Lexical diversity metrics

Zotero import capibility

LanguageSpacy model in AVOBMATLemmatization (Spacy)Lemmatization (Lemmagen)Named entity recognitionNamed entity linking & disambiguationParts of speech tagging
SmallMediumLargeTransformer
Currently supported spaCy models
languages
Catalan
ChineseComing soon
Croatian
Danish
Dutch
English
Finnish
French
German
Greek
Italian
JapaneseComing soon
Korean
Lithuanian
Macedonian
Multilanguage
Norwegian
Polish
Portuguese
Romanian
Russian
Slovenian
Spanish
Swedish
Ukranian
Hungarian
SpaCy models to be added soon
Afrikaans
Albanian
Amharic
Ancient Greek
Arabic
Armenian
Azerbaijani
Basque
Bengali
Bulgarian
Czech
Estonian
Faroese
Gujarati
Hebrew
Hindi
Icelandic
Indonesian
Irish
Kannada
Kyrgyz
Latin
Latvian
Ligurian
Lower Sorbian
Luganda
Luxembourgish
Malay
Malayalam
Marathi
Nepali
Norwegian Nynorsk
Persian
Sanskrit
Serbian
Setswana
Sinhala
Slovak
Tagalog
Tamil
Tatar
Telugu
Thai
Tigrinya
Turkish
Upper Sorbian
Urdu
Vietnamese
Yoruba
Current Lemmagen
support
Slovenian
Serbian
ItalianComing soon
Romanian
Czech
Bulgarian
Estonian

*AVOBMAT can identify the language of texts in 52 languages before further processing.  Learn more>

AVOBMAT 2024©