Labs/Ubiquity/Parser 2: Difference between revisions

Line 34: Line 34:


=step 1: split words/arguments + case markers=
=step 1: split words/arguments + case markers=
{{Labs/Ubiquity/Template:Infobox parser step
|input = input argument (<code>Parser.Query.input</code>)
|output = updated input string <code>Parser.Query._input</code>}}
Step 1 doesn't actually split the words up into an array, but it does insert whitespace and no-width whitespace between characters in the input to facilitate future parsing.
Japanese: split on common particles... in the future get feedback from user for this
Japanese: split on common particles... in the future get feedback from user for this


Chinese: split on common functional verbs and prepositions
Chinese: split on common functional verbs and prepositions
strongly case marking languages: split off case affixes


=step 2: pick possible Verbs =
=step 2: pick possible Verbs =
308

edits