This is a python package for Indic language (mostly sanskrit) metre identification and related tasks like syllabization. This module expects devanAgarI input, and currently produces IAST output.
- Autogenerated Docs on readthedocs (might be broken).
- Manually and periodically generated docs here
- For detailed examples and help, please see individual module files in this package.
sudo pip install chandas -U
sudo pip install git+https://github.com/sanskrit-coders/chandas/@master -U
- Web.
For more examples, see tests.
from chandas import identify
pattern_lines = identify.to_pattern_lines("निर्दिष्टाङ् कुलपतिना स पर्णशालाम् अध्यास्य प्रयतपरिग्रहद्वितीयः ।\nतच्छिष्याध्ययननिवेदितावसानां सव्ँविष्टः कुशशयने निशान् निनाय ॥".split("\n"))
id_result = identify.identifier.IdentifyFromPatternLines(pattern_lines)
assert id_result['exact'] == "Praharṣiṇī"
from chandas import syllabize
syllabize.get_syllables(u"ॐ मणि पद्मे ऽहम्") == "ओम् म णि पद् मे हम्"
syllabize.get_graphemes(u"बिक्रममेरोनामहो") == "बि क् र म मे रो ना म हो".split(" ")
Please feel free to use test data published here to test your own modules:
- We copy the data and identification code from shreevatsa's repo. This code has been transformed a bit to conform to PEP conventions.
- You may be interested in sharing and contributing to a common pool of test cases - see the chandas-id and syllabization tests under https://github.com/sanskrit-coders/chandas/tree/master/tests/data .
Have a problem or question? Please head to github.
- ~/.pypirc should have your pypi login credentials.
python setup.py bdist_wheel
twine upload dist/* --skip-existing
- sphinx html docs can be generated with
cd docs; make html
Run pytest
in the root directory.