Skip to content

StarlangSoftware/NlpToolkit

Repository files navigation

NlpToolkit

OverView of Resources

Turkish Natural Language Resources in NlpToolkit

  1. Open, Extendible, and Fast Morphological Analyzer Dilbaz.
    • Over 60.000 root words.
    • FST and rule engine.
    • The only morphological analyzer backed by WordNet.
  2. Turkish WordNet: KeNet.
    • 80.000 synset.
    • 125.000 semantic relations including 25.000 interlingual relations.
  3. SentiNet: HisNet.
    • Parallel with KeNet.
    • 80.000 senti-synset.
  4. Turkish PropBank: TRopBank.
    • 18.000 verb argument relationship.
    • Parallel with KeNet.
  5. Turkish TreeBank.
    • 74.000 trees (27.000 syntactic 47.000 dependency trees).
    • Syntactically and semantically annotated
      • Morphological Analysis
      • Semantic Roles
      • Word Senses
      • Dependency Analysis

Overview of Library

36 NLP code libraries

Word Level Analysis

  1. Morphological Analysis Dilbaz v1.0.40 Javac++c#swiftpythoncythonjs
  2. Morphological Disambiguation v1.0.19 Javac++c#pythoncythonjs
  3. Turkish Dictionary v1.0.22 Javac++c#swiftpythoncythonjs
  4. Turkish WordNet KeNet v1.0.44 Javac++c#swiftpythoncythonjs
  5. Turkish SentiNet HisNet v1.0.7 Javac++c#swiftpythoncythonjs
  6. English PosTagger v1.0.3 Javac++c#swiftpythoncythonjs
  7. Turkish SpellChecker v1.0.26 Javac++c#swiftpythoncythonjs
  8. Turkish Deasciifier v1.0.24 Javac++c#swiftpythoncythonjs

Sentence Level Analysis

  1. Turkish PropBank TRopBank v1.0.11 Javac++c#swiftpythoncythonjs
  2. Dependency Parser v1.0.15 Javac++c#swiftpythoncythonjs
  3. Syntactic Parser v1.0.1 Java
  4. ParseTree v1.0.7 Javac++c#swiftpythoncythonjs
  5. AnnotatedSentence v1.0.31 Javac++c#swiftpythoncythonjs
  6. AnnotatedTree v1.0.19 Javac++c#pythoncythonjs
  7. Corpus v1.0.3 Javac++c#swiftpythoncythonjs
  8. NER Corpus Processing v1.0.6 Javac++c#swiftpythoncythonjs
  9. Turkish FrameNet v1.0.6 Javac++c#swiftpythoncythonjs
  10. Phrase to Dependency, Dependency to Phrase Structure Converter v1.0.1 Javac#

Preliminaries

  1. DataStructure v1.0.4 Javac++c#swiftpythoncythonjs
  2. Math v1.0.5 Javac++c#swiftpythoncythonjs
  3. Utils v1.0.3 Javac++c#swiftpythoncythonjs

Basic Algorithms

  1. Hmm v1.0.2 Javac++c#swiftpythoncythonjs
  2. NGram v1.0.19 Javac++c#swiftpythoncythonjs
  3. WordToVec v1.0.3 Javac++c#swiftpythoncythonjs

GUI

  1. DataCollector v1.0.19 Java

Machine Learning

  1. Sampling v1.0.1 Javac++c#swiftpythoncythonjs
  2. Classification v1.0.8 Javac++c#swiftpythoncythonjs
  3. DataGenerator v1.0.8 Javac++c#pythoncython
  4. Clustering v1.0.0 Java

High Level Analysis

  1. Semantic Role Labeling v1.0.2 Javac++c#pythoncythonjs
  2. Named Entity Recognition v1.0.3 Javac++c#swiftpythoncythonjs
  3. Word Sense Disambiguation v1.0.3 Javac++c#swiftpythoncythonjs
  4. Sentiment Analysis v1.0.2 Javac++c#
  5. Shallow Parsing v1.0.2 Java
  6. Dependency Parsing v.1.0.2 Java
  7. Machine Translation v1.0.3 Java

Best Fit for Natural Language Processing Teaching (NLP, Advanced NLP, Information Retrieval).

Contributors

  1. Linguistic Annotators
    • Undergraduate students from Işık University
    • Linguistics students from Boğaziçi University (Undergraduate, Graduate)
    • Starlang
  2. Computer Engineers
    • Işık University M.S. Students
    • Işık University Ph.D. Students

About

Natural Language Processing Library

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published