Design of MAHA Supercomputing System for Human Genome Analysis


KIPS Transactions on Software and Data Engineering, Vol. 2, No. 2, pp. 81-90, Feb. 2013
10.3745/KTSDE.2013.2.2.81,   PDF Download:

Abstract

During the past decade, many changes and attempts have been tried and are continued developing new technologies in the computing area. The brick wall in computing area, especially power wall, changes computing paradigm from computing hard wares including processor and system architecture to programming environment and application usage. The high performance computing (HPC) area, especially, has been experienced catastrophic changes, and it is now considered as a key to the national competitiveness. In the late 2000`s, many leading countries rushed to develop Exascale supercomputing systems, and as a results tens of PetaFLOPS system are prevalent now. In Korea, ICT is well developed and Korea is considered as a one of leading countries in the world, but not for supercomputing area. In this paper, we describe architecture design of MAHA supercomputing system which is aimed to develop 300 TeraFLOPS system for bio-informatics applications like human genome analysis and protein-protein docking. MAHA supercomputing system is consists of four major parts-computing hardware, file system, system software and bio-applications. MAHA supercomputing system is designed to utilize heterogeneous computing accelerators (co-processors like GPGPUs and MICs) to get more performance/$, performance/area, and performance/power. To provide high speed data movement and large capacity, MAHA file system is designed to have asymmetric cluster architecture, and consists of metadata server, data server, and client file system on top of SSD and MAID storage servers. MAHA system soft wares are designed to provide user-friendliness and easy-to-use based on integrated system management component - like Bio Workflow management, Integrated Cluster management and Heterogeneous Resource management. MAHA supercomputing system was first installed in Dec., 2011. The theoretical performance of MAHA system was 50 TeraFLOPS and measured performance of 30.3 TeraFLOPS with 32 computing nodes. MAHA system will be upgraded to have 100 TeraFLOPS performance at Jan., 2013.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
Y. W. Kim, H. Y. Kim, S. J. Bae, H. Y. Kim, Y. C. Woo, S. J. Park, W. Choi, "Design of MAHA Supercomputing System for Human Genome Analysis," KIPS Transactions on Software and Data Engineering, vol. 2, no. 2, pp. 81-90, 2013. DOI: 10.3745/KTSDE.2013.2.2.81.

[ACM Style]
Young Woo Kim, Hong Yeon Kim, Seung Jo Bae, Hag Young Kim, Young Choon Woo, Soo Jun Park, and Wan Choi. 2013. Design of MAHA Supercomputing System for Human Genome Analysis. KIPS Transactions on Software and Data Engineering, 2, 2, (2013), 81-90. DOI: 10.3745/KTSDE.2013.2.2.81.