Cognitively Rich Framework to Automate Extraction and Representation of Legal Knowledge

Saha, Srishty; Joshi, Karuna P.

Cognitively Rich Framework to Automate Extraction and Representation of Legal Knowledge

dc.contributor.author	Saha, Srishty
dc.contributor.author	Joshi, Karuna P.
dc.date.accessioned	2019-03-08T16:40:17Z
dc.date.available	2019-03-08T16:40:17Z
dc.description.abstract	With the explosive growth in cloud based services, businesses are increasingly maintaining large datasets containing information about their consumers to provide a seamless user experience. To ensure privacy and secu- rity of these datasets, regulatory bodies have speci ed rules and compliance policies that must be adhered to by organizations. These regulatory policies are currently available as text documents that are not machine processable and so require extensive manual e ort to monitor them continuously to ensure data compliance. We have developed a cognitive framework to automatically parse and extract knowledge from legal documents and represent it using an Ontology. The framework captures knowledge in form of key terms, rules, topic summaries, relationships between various legal terms, semantically similar ter- minologies, deontic expressions and cross-referenced legal facts and rules. We built the framework using Deep Learning technologies like Tensor ow, for word embeddings and text summarization, Gensim for topic modeling and Se- man- tic Web technologies for building the knowledge graph. We have applied this framework to the United States government's Code of Federal Regulations (CFR) which includes facts and rules for individuals and organizations seek- ing to do business with the US Federal government. In this paper we describe our framework in detail and present results of the CFR legal knowledge base that we have built using this framework. Our framework can be adopted by businesses to build their automated compliance monitoring system.	en
dc.description.sponsorship	This research was partially supported by a DoD supplement to the NSF award 1439663 : NSF I/UCRC Center for Hybrid Multicore Productivity Research (CHMPR).	en
dc.format.extent	18 pages	en
dc.genre	journal articles preprints	en
dc.identifier	doi:10.13016/m21nkk-ppkc
dc.identifier.uri	http://hdl.handle.net/11603/12996
dc.language.iso	en	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Information Systems Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subject	deep learning	en
dc.subject	legal text analytics	en
dc.subject	compliance	en
dc.subject	semantic web	en
dc.title	Cognitively Rich Framework to Automate Extraction and Representation of Legal Knowledge	en
dc.type	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Karuna_shrishty-LegalPaper.pdf
Size:: 750.54 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Information Systems Department
UMBC Faculty Collection