Skip to content

Repository of the dataset and code published with the paper titled "Taxonomy of Mathematical Plagiarism" at ECIR 2024

License

Notifications You must be signed in to change notification settings

gipplab/Taxonomy-of-Mathematical-Plagiarism

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Taxonomy of Mathematical Plagiarism

This repository provides the dataset published in the paper "Taxonomy of Mathematical Plagiarism" and experimented's code.

Dataset

We curated a dataset of potentially plagiarised document math content span pairs along with Obfuscation (the way in which content is modified) types. The dataset and information on the accompanying files are available in data/

Experiments

We analyzed the best-performing approaches to detect plagiarism and mathematical content similarity on the newly established taxonomy. Corresponding code is present in code/experiments/.

Paper

A. Satpute, A. Greiner-Petter, N. Giessing, I. Beckenbach, M. Schubotz, O. Teschke, A. Aizawa, and B. Gipp, “Taxonomy of Mathematical Plagiarism,” in 46th European Conference on Information Retrieval (ECIR), Glasgow, Scotland, 2024.

License

CC-BY-SA 4.0. This defines the license for the whole dataset, which contains non-copyrighted bibliographic metadata and reference data derived from I4OSC (CC0).

About

Repository of the dataset and code published with the paper titled "Taxonomy of Mathematical Plagiarism" at ECIR 2024

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages