Linguistics

81 resources1 categoriesView Original

未分类(81 items)

/

/r/LanguageTechnology/

r
1

15 most popular books on good reads

15
A

Araneum Germanicum

araneum
A

Awesome Community-Curated NLP List

awesome
A

awesome Information Retrieval

awesome
A

awesome-chinese-nlp

awesome
A

awesome-danish

awesome
A

awesome-hungarian-nlp

awesome
A

awesome-nlp

awesome
A

awesome-nlp-polish

awesome
A

awesome-spanish-nlp

awesome
B

Bag of words model

bag
C

C-WEP

c
C

CEHugeWebCorpus

German corpus based on CommonCrawl

cehugewebcorpus
C

CLARIN-D web tools

Tools for Analysing Research Data

clarin
C

Computational Linguistics Lecture Playlist (You...

Lectures for University of Maryland class on computational linguistics.

computational
C

corpus-linguistics

GitHub topics &

corpus
C

CorpusExplorer

Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 50 interactive visualizations under a user-friendly interface.

corpusexplorer
D

dbmdz BERT models

dbmdz
D

Deepset German BERT model

deepset
D

Digitales Wörterbuch der deutschen Sprache (DWDS)

digitales
D

Document classification

document
D

DTA Basisformat

dta
D

DysList (list of dyslexic errors)

dyslist
E

Essentials of Linguistics, 2nd edition

An introductory book (2nd edition).

essentials
E

EuroRomCom Data

JSON formatted Pan-Romance word lists.

euroromcom
E

Evaluating German Transformer Language Models w...

evaluating
F

Falko

falko
F

Foundations of Computational Linguistics

foundations
F

Foundations of Statistical Natural Language Pro...

foundations
G

GC4 Corpus

(CommonCrawl)

gc4
G

German ELMo Model

german
G

German NLP resources

german
G

german-transformer-training

german
G

GermLM

(NER exploration)

germlm
G

GerPT2

gerpt2
H

Haxe-linguistics

Early linguistical analysis and natural language processing library for Haxe.

haxe
I

IDS Corpora

German Reference Corpus

ids
I

Indonesian NLP

indonesian
I

Introduction to Linguistics

introduction
I

ISO TC 37 SC 4

iso
L

Language models

language
L

Language Science Press

Language Science Press is a born-digital scholar-led open access publisher in linguistics.

language
L

Leipzig Corpora Collection

sampled sentences in different languages.

leipzig
L

Linguistics Stack Exchange

linguistics
L

Litkey

litkey
L

Low Resource Languages

A list of resources for conservation, development, and documentation of low resource (human) languages.

low
M

M. Weisser's list of NLP/Computational Linguist...

m
M

Mate Tools

, webservice via WebLicht

mate
N

Naive Bayes classification

naive
N

Natural

General natural language tools for Node.js.

natural
N

Natural language processing

natural
N

Natural Language Processing with Python

The book from the NLTK package.

natural
N

Natural Language ToolKit (NLTK)

The most complete platform for building Python programs to work with human language data.

natural
N

nlp

GitHub topics &

nlp
N

nlp-datasets

nlp
N

NLP-progress

nlp
N

Norwegian NLP resources

norwegian
O

OpinionSpam

opinionspam
O

Outline of natural language processing

outline
P

Parts of speech tagging

parts
S

SdeWaC

big german internet corpus

sdewac
S

Semisupervised Learning for Computational Lingu...

semisupervised
S

Sentence Transformers

sentence
S

Sentiment analysis

sentiment
S

Snowball

Snowball is a language in which stemming algorithms can be easily represented.

snowball
S

Spacy

Industrial-strength National Language Processing in Python.

spacy
S

Speech and Language Processing: An Introduction...

speech
S

Stemming algorithms for various European languages

Various stemming algorithms from snowball.

stemming
T

Term frequency - inverse document frequency

term
T

Text Mining with R

text
T

textblob-de

Nice alternative for spacy (see above).

textblob
T

The Oxford Handbook of Computational Linguistics

the
T

The Porter Stemmer Algorithm

The ‘official’ home page for distribution of the Porter Stemming Algorithm, written and maintained by its author, Martin Porter.

the
T

The Virtual Linguistics Campus

CC-licensed educational videos interconnected with Marburg University's e-learning platform of the same name.

the
T

tyo

A utility for finding Typo-Bridges.

tyo
U

UBIAI

Easy-to-use text annotation tool for teams with most comprehensive auto-annotation features. Supports NER, relations and document classification as well as OCR annotation for invoice labeling.

ubiai
U

UIMA

uima
U

Untranslatable.co, Multilingual urban dictionary

untranslatable
U

UralicNLP

An open source Python library for processing morphologically rich and, for the most part, endangered Uralic languages. It can do morphological analysis, generation, lemmatization, disambiguation and lexical lookup for a great many Uralic languages.

uralicnlp
V

Vector space model

vector