Computer Science Technical Reports
CS at VT

Recent Developments in Document Clustering

Andrews, Nicholas O. and Fox, Edward A. (2007) Recent Developments in Document Clustering. Technical Report TR-07-35, Computer Science, Virginia Tech.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.
docclust.pdf (547732)


This report aims to give a brief overview of the current state of document clustering research and present recent developments in a well-organized manner. Clustering algorithms are considered with two hypothetical scenarios in mind: online query clustering with tight efficiency constraints, and offline clustering with an emphasis on accuracy. A comparative analysis of the algorithms is performed along with a table summarizing important properties, and open problems as well as directions for future research are discussed.

Item Type:Departmental Technical Report
Subjects:Computer Science > Algorithms and Data Structure
ID Code:1000
Deposited By:Administrator, Eprints
Deposited On:17 October 2007