In the last couple of months, AVOBMAT has been presented and used at several events and university courses, including:
Print Culture and Public Spheres in Central Europe (1500–1800) COST Action Hackathon (University of Vienna)
"Multilingual Analysis and Visualization of Metadata and Texts in Central and Eastern European Literary Studies,” Digital Methods for a Comparative Study of Central European Literary History (University of Krakow)
Emigrants’ Heritage Workshop (Hungarian National Archives)
IT course (University of Warsaw)
We have preprocessed 5.4 billion tokens across multiple databases and made 32 databases publicly available.
Generative AI and LLMs are increasingly present in teaching and research, but LLM outputs can be difficult to reproduce without careful constraints and documentation. AVOBMAT is built for transparent, reproducible, corpus-scale analysis. It uses transformer language models to enrich texts and metadata, enabling interactive analysis across collections—so researchers and students can focus on critical interpretation and discovery.
What sets AVOBMAT apart for DH researchers, teachers, and libraries/GLAM:
Ready-to-use corpora for teaching: 32 public databases for seminar-based assignments
Explore and share your own collections: analyse your datasets and share private databases for collaboration
Scale: 5.4B tokens preprocessed across multiple databases
Reproducible workflows: explicit steps and documented parameters for verification and replication
Multilingual and accessible: 25 languages, no programming or costly hardware required
Extensible by design: modular architecture with potential future integration of task-specific open-weight LLMs
To unlock the full potential of your documents, submit an upload request with your preferred configuration settings. You don’t need any programming experience—just follow the upload steps in the Help menu or on GitHub. Sample databases and files are available in our GitHub repository.
Help us improve AVOBMAT. Please complete our 2–3 minute feedback form.
Interested in using AVOBMAT in a course, research project, or library/GLAM workflow? Please contact me to discuss a collaboration.
Thank you for your feedback.
We look forward to hearing from you.
Róbert Péter
on behalf of the AVOBMAT Team