User:Mitcho/ParserTNG: Difference between revisions

User:Mitcho/ParserTNG (view source)

Revision as of 01:13, 7 March 2009

147 bytes added , 7 March 2009

no edit summary

Mitcho

308

edits

@@ Line 4: / Line 4: @@
 ===High level overview:===
-. receive input
+# (split words/arguments)
-. (split words/arguments)
+# pick possible V's
-. pick possible V's
+# (pick possible clitics - for the (near) future)
-'. (pick possible clitics - for the (near) future)
+# group into arguments
-. group into arguments
+# noun type detection
-. noun type detection
+# rank
-. rank
 ===each language will have:===
@@ Line 18: / Line 17: @@
 <b>EX:</b> <code>add lunch with Dan tomorrow to my calendar</code>
-==step 1==
+==step 1: split words/arguments==
 Japanese: split on common particles... in the future get feedback from user for this
 Chinese: split on common functional verbs and prepositions
@@ Line 24: / Line 23: @@
 (Maybe split case marking prefixes/suffixes into individual words here?)
-==step 2==
+==step 2: pick possible V's==
 Ubiq will cache a regexp for detection of substrings of verb names. For example: <code>(a|ad|add|add-|...|add-to-calendar|g|go|...google...)</code>
@@ Line 33: / Line 32: @@
 <b>EX</b>: <code>('add','lunch with Dan tomorrow to my calendar'), ('','add lunch with Dan tomorrow to my calendar')</code>
-==step 3==
+==step 3: pick possible clitics==
+TODO
+==step 4: group into arguments==
 Find delimiters (see above).
@@ Line 64: / Line 67: @@
 (Note: for words which are not incorporated into an oblique argument (aka "modifier argument"), they are pushed onto the DO list.)
-step 4:
+==step 5: noun type detection==
 For each parse, send each argument string to the noun type detector. The noun type detector will cache detection results, so it only checks each string once. This returns a list of possible noun types with their "scores".
@@ Line 71: / Line 74: @@
 'my calendar' -> [{type: service, score: 1},{type: arb, score: .7}]
-step 5:
+==step 6: ranking==
+<code>
 foreach parse (w/o V)
    by semantic roles in the parse, find appropriate verbs
    foreach possible verb
      score = \prod_{each semantic role in the verb} score(the content of that argument being the appropriate nountype)
+</code>
-EX:
+<b>EX:</b>
 {V:    null,
@@ Line 103: / Line 109: @@
 score = score * (1-0.5**(#DO-1)) (example algorithm)
-EX: score = 1, with 2 direct objects, so
+<b>EX:</b> score = 1, with 2 direct objects, so
 score = 1 * (1-0.5**1) = 1 * 0.5 = 0.5

User:Mitcho/ParserTNG: Difference between revisions

User:Mitcho/ParserTNG (view source)

Revision as of 01:13, 7 March 2009

Navigation menu

Search