Software
I’m an avid support of free open-source software (FOSS), (free as in free speech), and have published a lot thereof. Most of my software is hosted on GitHub and Sourcehut. I also maintain packages for Arch Linux, Debian and Alpine Linux. I am active in promoting good research software quality & sustainability. These latest years, I have a tendency to favour more minimalistic software, as there is a lot of bloated unmaintainable software and needless complexity around. One of the things of paramount importance for me, is that you are in control of your own software, and that it doesn't compromise your privacy or security. I wrote some posts on that subject in my blog section as well.
My main programming languages are Python, Rust, C, C++, shell scripting, and javascript.
Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an annotation.
Extensions for todo.txt: interactive rofi/fzf control, sync github issues, better colors, time tracking... and more!
Labirinto is a virtual laboratory portal, it makes a collection of software browseable and searchable for the end-user. Labirinto presents the software's metadata following the CodeMeta specification in an intuitive way and allows the user to filter and perform a limited search. The portal gives access to software if it offers web-based interfaces. This system is specifically geared towards research software, and for instance allows linking to relevant scientific publications for each tool.
LaMachine is a unified software distribution for Natural Language Processing. We integrate numerous open-source NLP tools, programming libraries, web-services, and web-applications in a single Virtual Research Environment that can be installed on a wide variety of machines.
FLAT is a web-based linguistic annotation environment based around the FoLiA format, a rich XML-based format for linguistic annotation. FLAT allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm. It is a document-centric tool that fully preserves and visualises document structure.
FoLiA is an XML-based annotation format, suitable for the representation of linguistically annotated language resources. FoLiA’s intended use is as a format for storing and/or exchanging language resources, including corpora. Our aim is to introduce a single rich format that can accommodate a wide variety of linguistic annotation types through a single generalised paradigm. We do not commit to any label set, language or linguistic theory.
Valkuil.net is een automatische spellingcorrector voor het Nederlands die zowel gewone typefouten als grammaticale fouten en verwarringen tussen bestaande woorden opspoort.