Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tokenize SD input? #8

Open
spyysalo opened this issue Oct 2, 2014 · 0 comments
Open

Tokenize SD input? #8

spyysalo opened this issue Oct 2, 2014 · 0 comments
Assignees

Comments

@spyysalo
Copy link
Owner

spyysalo commented Oct 2, 2014

The SD parser currently only separates tokens by whitespace, so that e.g. the last token of

~~~ sdparse
foo bar.
dep(foo, bar)
~~~

is bar., making the above break as the system can't find the token bar (without terminal dot). This appears to be a common source of error in manually entered SD analyses.

The possibility of doing e.g. PTB-like tokenization of input should be at least considered.

@spyysalo spyysalo self-assigned this Oct 2, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant