Generalized Vector Space Model using Karl Pearson correlation coefficients
-
Updated
Dec 5, 2016 - Python
Generalized Vector Space Model using Karl Pearson correlation coefficients
Designed a scalable and efficient search engine in Python to query a Wikipedia corpus of ~75GB with a response time of 1s and outputs the top 10 relevant documents based on the search query.
Web Retrieval and Mining 2020Spring
Information Retrieval, Natural Language Processing, Machine Learning
IR and text mining project to calculate candidate's job profile score based on factors such as Education, Discipline, Required Skills, Desired Skills, and Years of Experience. Implemented Inverted Index algorithm for job filtering and Vector Space Model algorithm for ranking the documents (Jobs).
Vector space model for information retrieval
Hello folks! Looking for a fully modular, open source, Pygame 3d-Engine concept? Well this may be a good start. GridMod is capable of visualising self made three dimensional shapes, by groupping nodes, vectors and matrices, and by applying common matrix operations to those vertices we get to display those predefined 3d objects, along with a scal…
Knowledge processing technologies : Information Retrieval and text classification
Information Retrieval System
Documents and queries are represented as vectors. Each dimension corresponds to a separate term. If a term occurs in the document, its value in the vector is non-zero. Several different ways of computing these values, also known as (term) weights, have been developed. One of the best known schemes is tf-idf weighting (see the example below). The…
Using Apache Lucene to index documents in AP89 corpus, perform retrieval on TREC topics and evaluate the performance of retrieval algorithms using different evaluation metrics
Domain specific information retrieval system based on boolean retrieval and vector space models
Search engine based on the vector space model
Implementation of a vector space-based information retrieval system.
A co-authored project (lead author & researcher: Karlina Denistia, Ph.D.) on a corpus-based, distributional-semantic study of Indonesian Classifier. This repository tracks the versioning of the R codes for cosine similarity analyses performed by @gederajeg. See the OSF link below for the complete set of materials.
Vector Space based End-to-End Web Retrieval/ Search Engine with self-contained units
Mathematical tools to ease applied Maths and Physics in JavaScript
A basic and intuitive Python module for (Vector Space) IR system. (Focuses on simplicity and understandability)
Add a description, image, and links to the vector-space-model topic page so that developers can more easily learn about it.
To associate your repository with the vector-space-model topic, visit your repo's landing page and select "manage topics."