You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A closer look at the shabdArtha_kaustubha.babylon reveals a pattern: For every word, for every meaning of it, the पदविभागः tag shows up mandatorily, and other optional tags (like कन्नडार्थः, निष्पत्तिः etc) follow. Also, for every meaning, the word itself is first repeated, with a <br><br> delimiter. Example: the word अंश has 3 meanings (Inserting newlines for now, for readability):
Most entries follow this pattern. Hence it'd be a good idea to enforce it as a rule over those that don't as well. This would make automated parsing easier. Thus, we get:
This, together with typo-fixing, and also standardizing* the पदविभाग tag, has been already addressed in the first commit of this forked branch.
*By standardizing, we mean पदविभाग is ensured to be one of these:
"अव्ययम्",
"क्रियाविशेषणम्",
"त्रिलिङ्गः",
"धातुः",
"नपुंसकलिङ्गः",
"नामपदम्",
"पुल्लिङ्गः",
"पुल्लिङ्गः / नपुंसकलिङ्गः",
"पुल्लिङ्गः / स्त्रीलिङ्गः",
"विशेष्यनिघ्नम्",
"स्त्रीलिङ्गः",
"स्त्रीलिङ्गः / नपुंसकलिङ्गः"
Hesitating to pull-request due to having addressed multiple issues in one commit :(
A closer look at the
shabdArtha_kaustubha.babylon
reveals a pattern: For every word, for every meaning of it, the पदविभागः tag shows up mandatorily, and other optional tags (like कन्नडार्थः, निष्पत्तिः etc) follow. Also, for every meaning, the word itself is first repeated, with a<br><br>
delimiter. Example: the word अंश has 3 meanings (Inserting newlines for now, for readability):Most entries follow this pattern. Hence it'd be a good idea to enforce it as a rule over those that don't as well. This would make automated parsing easier. Thus, we get:
Before:
After:
The text was updated successfully, but these errors were encountered: