
Recent Developments in Document Clustering
2007) Recent Developments in Document Clustering. Technical Report TR-07-35, Computer Science, Virginia Tech. (
Full text available as: |
Abstract
This report aims to give a brief overview of the current state of document clustering research and present recent developments in a well-organized manner. Clustering algorithms are considered with two hypothetical scenarios in mind: online query clustering with tight efficiency constraints, and offline clustering with an emphasis on accuracy. A comparative analysis of the algorithms is performed along with a table summarizing important properties, and open problems as well as directions for future research are discussed.
Item Type: | Departmental Technical Report |
---|---|
Subjects: | Computer Science > Algorithms and Data Structure |
ID Code: | 1000 |
Deposited By: | Administrator, Eprints |
Deposited On: | 17 October 2007 |