首站-论文投稿智能助手
典型文献
A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce
文献摘要:
Protein-protein interactions are of great significance for human to understand the functional mechanisms of proteins. With the rapid development of high-throughput genomic technologies, massive protein-protein interaction (PPI) data have been generated, making it very difficult to analyze them efficiently. To address this problem, this paper presents a distributed framework by reimplementing one of state-of-the-art algorithms, i.e., CoFex, using MapReduce. To do so, an in-depth analysis of its limitations is conducted from the perspectives of efficiency and memory consumption when applying it for large-scale PPI data analysis and prediction. Respective solutions are then devised to overcome these limitations. In particular, we adopt a novel tree-based data structure to reduce the heavy memory consumption caused by the huge sequence informationof proteins. After that, its procedure is modified by following the MapReduce framework to take the prediction task distributively. A series of extensive experiments have been conducted to evaluate the performance of our framework in terms of both efficiency and accuracy. Experimental results well demonstrate that the proposed framework can considerably improve its computational efficiency by more than two orders of magnitude while retaining the same high accuracy.
文献关键词:
作者姓名:
Lun Hu
作者机构:
School of Computer Science and Technology, Dongguan University of Technology, Dongguan 523808,China;Xinjiang Technical Institute of Physics and Chemistry,Chinese Academy of Sciences, Urumqi 830000, China;School of Computer Science and Technology,Wuhan University of Technology, Wuhan 430070, China;Chongqing Engineering Research Center of Big Data Application for Smart Cities, and Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China;Center of Research Excellence in Renewable Energy and Power Systems, and the Department of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah 21589, Saudi Arabia;Department of Electrical and Computer Engineering, New Jersey Institute of Technology, Newark, NJ 07102 USA
引用格式:
[1]Lun Hu-.A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce)[J].自动化学报(英文版),2022(01):160-172
A类:
reimplementing,CoFex,Respective,informationof
B类:
Distributed,Framework,Large,scale,Protein,Interaction,Data,Analysis,Prediction,Using,MapReduce,interactions,are,great,significance,human,understand,functional,mechanisms,proteins,With,rapid,development,high,throughput,genomic,technologies,massive,PPI,data,have,been,generated,making,very,difficult,analyze,them,efficiently,To,address,this,problem,paper,presents,distributed,framework,by,one,state,algorithms,using,depth,analysis,its,limitations,conducted,from,perspectives,efficiency,memory,consumption,when,applying,large,prediction,solutions,then,devised,overcome,these,particular,adopt,novel,tree,structure,reduce,heavy,caused,huge,sequence,After,that,procedure,modified,following,take,task,distributively,series,extensive,experiments,evaluate,performance,our,terms,both,accuracy,Experimental,results,well,demonstrate,proposed,considerably,improve,computational,more,than,two,orders,magnitude,while,retaining,same
AB值:
0.620426
相似文献
Toward High-Performance Delta-Based Iterative Processing with a Group-Based Approach
Hui Yu;Xin-Yu Jiang;Jin Zhao;Hao Qi;Yu Zhang;Xiao-Fei Lia;Hai-Kun Liu;Fu-Bing Mao;Hai Jin-National Engineering Research Center for Big Data Technology and System,Huazhong University of Science and Technology,Wuhan 430074,China;Service Computing Technology and System Laboratory,Huazhong University of Science and Technology Wuhan 430074,China;Cluster and Grid Computing Laboratory,Huazhong University of Science and Technology,Wuhan 430074,China;School of Computer Science and Technology,Huazhong University of Science and Technology,Wuhan 430074,China;School of Computer Science and Technology,HUST,Wuhan;School of Computer Science and Technology at HUST,Wuhan;School of Computer Science and Technology,Huazhong University of Science and Technology,Wuhan;Huazhong University of Science and Technology(HUST),Wuhan
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。