70
edits
No edit summary |
|||
| Line 18: | Line 18: | ||
#[https://wiki.mozilla.org/Support/Intern/2011/Brinda/Carrot2#Carrot2_with_most-voted_documents.2Fquestions Most Voted] | #[https://wiki.mozilla.org/Support/Intern/2011/Brinda/Carrot2#Carrot2_with_most-voted_documents.2Fquestions Most Voted] | ||
#[https://wiki.mozilla.org/Support/Intern/2011/Brinda/Carrot2#Carrot2_with_large_set_of_data.28_unknown_topic.29 Unknown topic]<br> | #[https://wiki.mozilla.org/Support/Intern/2011/Brinda/Carrot2#Carrot2_with_large_set_of_data.28_unknown_topic.29 Unknown topic]<br> | ||
Carrot2 can be used upto ~8500 documents. However as the number of documents increases, it becomes slower and takes more memory. Tuning a large set of document also becomes slow and takes around 5-10 mins for tuning a single attribute. Its imortant to note that Carrot2 will cluster documents and give you a better idea about what the issues are so that you can look out for questions of those kinds in the forum. However it does not provide a reliable number of documents with a specific problem as there is an overlapping of documents with different cluster names. You also need to manually separate documents into different clusters in the Other Topics especially with a large set fo documents. | Carrot2 can be used upto ~8500 documents. However as the number of documents increases, it becomes slower and takes more memory. Tuning a large set of document also becomes slow and takes around 5-10 mins for tuning a single attribute. Its imortant to note that Carrot2 will cluster documents and give you a better idea about what the issues are so that you can look out for questions of those kinds in the forum. However it does not provide a reliable number of documents with a specific problem as there is an overlapping of documents with different cluster names. You also need to manually separate documents into different clusters in the Other Topics especially with a large set fo documents. | ||
edits