Support:Search v3: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
(Created page with '{{DRAFT}} This document refers to the search rewrite of Q4 2009. == Goals == * Improve index time (should be able to index at least once/hour). ** Move from Sphinx 0.9.8 to 0.…')
 
(added indices)
Line 28: Line 28:
  http://sumo-forumux.stage.mozilla.com/
  http://sumo-forumux.stage.mozilla.com/
The username/password is "support"/"stage".
The username/password is "support"/"stage".
== Indices ==
The current sphinx.conf provides the indices, with the indicate attributes:
=== forum_threads ===
Returns a '''threadId''' (for a thread)
{|
!attr name!!type!!description
|-
|forumid||int||
|-
|author_ord||int||crc32(userName)
|-
|status||int||crc32(status)
|-
|created||timestamp||
|-
|last_updated||timestamp||
|-
|replies||int||number of replies
|-
|authors||multi int||crc32(userName) for all posters in the thread
|}
=== forum_posts ===
Returns a '''threadId''' (for a specific post)
{|
!attr name!!type!!description
|-
|threadid||int||threadId of the actual thread (not the post)
|-
|author_ord||int||crc32(userName)
|-
|status ||int||crc32(status)
|-
|forumid||int||forum id for the thread
|-
|created||timestamp||
|}
=== wiki_pages ===
Returns a '''page_id'''
{|
!attr name!!type!!description
|-
|lastmodif||timestamp||
|-
|locale||int||crc32(lang)
|-
|tag||int multi||crc32(tag)
|-
|category||int multi||categId
|}

Revision as of 23:43, 11 November 2009

DRAFT
The content of this page is a work in progress intended for review.

Please help improve the draft!

Ask questions or make suggestions in the discussion
or add your suggestions directly to this page.


This document refers to the search rewrite of Q4 2009.

Goals

  • Improve index time (should be able to index at least once/hour).
    • Move from Sphinx 0.9.8 to 0.9.9.
    • Replace xmlpipe2 with direct SQL access.
  • Improve advanced search.
    • Implement new UI. [1]
    • Meaningful options for both knowledge base and forums.
  • Improve spelling suggestions.
    • Move from SOUNDEX to Aspell.
      • Dramatically improve suggestions.
      • Will hopefully improve performance.

Timeline

Search v3 is the primary goal of SUMO 1.5. Our current goal is to code freeze on November 24th and release on December 1st.

Code

The Search v3 branch is stored in

https://svn.mozilla.org/projects/sumo/branches/search

There is a dedicated staging server running Sphinx 0.9.9 at

http://sumo-forumux.stage.mozilla.com/

The username/password is "support"/"stage".

Indices

The current sphinx.conf provides the indices, with the indicate attributes:

forum_threads

Returns a threadId (for a thread)

attr name type description
forumid int
author_ord int crc32(userName)
status int crc32(status)
created timestamp
last_updated timestamp
replies int number of replies
authors multi int crc32(userName) for all posters in the thread

forum_posts

Returns a threadId (for a specific post)

attr name type description
threadid int threadId of the actual thread (not the post)
author_ord int crc32(userName)
status int crc32(status)
forumid int forum id for the thread
created timestamp

wiki_pages

Returns a page_id

attr name type description
lastmodif timestamp
locale int crc32(lang)
tag int multi crc32(tag)
category int multi categId