Difference between revisions of "Language Modeling"

From K-Dutch ATO
Jump to navigation Jump to search
 
Line 5: Line 5:


==BERT-like models==
==BERT-like models==
* [https://huggingface.co/models?search=dutch Hugging Face Dutch Models]
* [https://people.cs.kuleuven.be/~pieter.delobelle/robbert/ RobBERT]: A Dutch RoBERTa-based Language Model
* [https://people.cs.kuleuven.be/~pieter.delobelle/robbert/ RobBERT]: A Dutch RoBERTa-based Language Model


==SpaCy==
==SpaCy==
* [https://spacy.io/models/nl Dutch models]
* [https://spacy.io/models/nl Dutch models]

Latest revision as of 15:28, 23 March 2021

n-gram modeling[edit | edit source]

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.

BERT-like models[edit | edit source]

SpaCy[edit | edit source]