Browsing by Subject "Amazon Web Services (Firm)"
Now showing 1 - 1 of 1
Results Per Page
Sort Options
Item Distributed Search Of Biological Databases Using Hadoop/Mapreduce(2015) Fashola, Babatunde Olaide; Stojkovic, Vojislav; Computer Science and Bioinformatics Program; Master of ScienceThe main goals of this thesis research were to: 1. Make a computational platform/environment for thesis research. 2. Develop a MapReduce search algorithm that employs the scalability of a Hadoop cluster and the MapReduce functionalities to make the search of a biological database faster. 3. Implement the MapReduce search algorithm using the Java programming language, and running the consequent Java application in a Hadoop multi-node cluster in the cloud. 4. Compare execution times of - The MapReduce search program - The serial search programs – Boyer-Moore Algorithm and Knuth-Morris-Pratt Algorithm 13 GB of downloadable GenBank data was processed over the Hadoop framework installed on a 12-node cluster comprised of the Amazon EC2 t2.micro instance types. The execution time of the distributed search program is 46% faster than the execution times of the serial programs. Hence, the present search algorithms used for accessing the biological databases can incorporate the MapReduce programming model to improve their performances