Constrained Coclustering for Textual Documents
| dc.contributor.author | Song, Yangqiu | |
| dc.contributor.author | Pan, Shimei | |
| dc.contributor.author | Liu, Shixia | |
| dc.contributor.author | Wei, Furu | |
| dc.contributor.author | Zhou, Michelle | |
| dc.contributor.author | Qian, Weihong | |
| dc.date.accessioned | 2025-06-05T14:03:45Z | |
| dc.date.available | 2025-06-05T14:03:45Z | |
| dc.date.issued | 2010-07-03 | |
| dc.description | Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2010 | |
| dc.description.abstract | In this paper, we present a constrained co-clustering approach for clustering textual documents. Our approach combines the benefits of information-theoretic co-clustering and constrained clustering. We use a two-sided hidden Markov random field (HMRF) to model both the document and word constraints. We also develop an alternating expectation maximization (EM) algorithm to optimize the constrained co-clustering model. We have conducted two sets of experiments on a benchmark data set: (1) using human-provided category labels to derive document and word constraints for semi-supervised document clustering, and (2) using automatically extracted named entities to derive document constraints for unsupervised document clustering. Compared to several representative constrained clustering and co-clustering approaches, our approach is shown to be more effective for high-dimensional, sparse text data. | |
| dc.description.uri | https://ojs.aaai.org/index.php/AAAI/article/view/7680 | |
| dc.format.extent | 6 pages | |
| dc.genre | conference papers and proceedings | |
| dc.genre | postprints | |
| dc.identifier | doi:10.13016/m2dv6n-2amh | |
| dc.identifier.citation | Song, Yangqiu, Shimei Pan, Shixia Liu, Furu Wei, Michelle Zhou, and Weihong Qian. “Constrained Coclustering for Textual Documents.” Proceedings of the AAAI Conference on Artificial Intelligence 24, no. 1 (July 3, 2010): 581–86. https://doi.org/10.1609/aaai.v24i1.7680. | |
| dc.identifier.uri | https://doi.org/10.1609/aaai.v24i1.7680 | |
| dc.identifier.uri | http://hdl.handle.net/11603/38750 | |
| dc.language.iso | en_US | |
| dc.publisher | AAAI | |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Information Systems Department | |
| dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
| dc.subject | semi-supervised learning | |
| dc.title | Constrained Coclustering for Textual Documents | |
| dc.type | Text | |
| dcterms.creator | https://orcid.org/0000-0002-5989-8543 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- ConstrainedCoclusteringforTextualDocuments.pdf
- Size:
- 642.23 KB
- Format:
- Adobe Portable Document Format
