Search
Now showing items 1-10 of 118
HLTCOE Approaches to Knowledge Base Population at TAC 2009
(National Institute of Standards and Technology, 2009-11-01)
The HLTCOE participated in the entity linking and slot filling tasks at TAC 2009. A machine learning-based approach to entity linking, operating over a wide range of feature types, yielded good performance on the entity ...
Improving Binary Classification on Text Problems using Differential Word Features
(ACM, 2009-11-02)
We describe an efficient technique to weigh word-based features in binary classification tasks and show that it significantly improves classification accuracy on a range of problems. The most common text classification ...
Ensembles in Adversarial Classification for Spam
(ACM, 2009-11-02)
The standard method for combating spam, either in email or on the web, is to train a classifier on manually labeled instances. As the spammers change their tactics, the performance of such classifiers tends to decrease ...
Video Summarization of Laparoscopic Cholecystectomies
(2009-11-14)
We compared image features with a distance metric and support vector machine to identify the critical view of a laparoscopic cholecystectomy. Our accuracy was up to 91%. We are currently experimenting with particle analysis, ...
Policy-based Malicious Peer Detection in Ad Hoc Networks
(IEEE, 2009-08-29)
Mobile Ad hoc Networks (MANETs) are susceptible to various node misbehaviors due to their unique features, such as highly dynamic network topology, rigorous power constraints and error-prone transmission media. Significant ...
Delta TFIDF: An Improved Feature Space for Sentiment Analysis
(AAAI, 2009-05-17)
Mining opinions and sentiment from social networking sites is a popular application for social media systems. Common approaches use a machine learning system with a bag of words feature set. We present Delta TFIDF, an ...
Wikipedia as an Ontology for Describing Documents
(AAAI, 2008-03-31)
Identifying topics and concepts associated with a set of documents is a task common to many applications. It can help in the annotation and categorization of documents and be used to model a person's current interests for ...
Second Space: A Generative Model For The Blogosphere
(AAAI, 2008-03-31)
Web graphs have been very useful in the structural and statistical analysis of the web. Various models have been proposed to simulate web graphs that generate degree distributions similar to the web. Real world blog networks ...
Approximating the Community Structure of the Long Tail
(AAAI, 2008-03-31)
In many social media applications, a small fraction of the members are highly linked while most are sparsely connected to the network. Such a skewed distribution is sometimes referred to as the "long tail". Popular ...
Enforcing security in semantics driven policy based networks
(IEEE, 2008-04-12)
Security is emerging as an important requirement for a number of distributed applications such as online banking, social networking etc. due to the private nature of the data being involved. Further more, the wide spread ...