124
Views
2
CrossRef citations to date
0
Altmetric
Original Articles

Concentric layout, a new scientific data layout for matrix data-set in Hadoop file system

, &
Pages 407-433 | Received 15 Mar 2012, Accepted 04 Aug 2012, Published online: 13 Sep 2012

References

  • V. Akcelik, J. Bielak, G. Biros, I. Epanomeritakis, A. Fernandez, O. Ghattas, E.J. Kim, J. Lopez, D.R. O'Hallaron, T. Tu, and J. Urbanic, High Resolution Forward and Inverse Earthquake Modeling on Terasacale Computers, Proceedings of SC2003 (2003)
  • J. Bent, G. Gibson, G. Grider, B. McClelland, P. Nowoczynski, J. Nunez, M. Polte, and M. Wingate, PLFS: A checkpoint Filesystem for Parallel Applications, In Supercomputing, 2009 ACM/IEEE Conference (2009)
  • A. Bhatkar and J.L. Rana, Estimating Neutral Divergence Amongst Mammals for Comparative Genomics With Ammalian Scope, Proceedings of the 9th International Conference on Information Technology (pp. 3–6), Washington, DC, USA, 2006. IEEE Computer Society
  • S. Bhattacharya, C. Mohan, K.W. Brannon, I. Narang, H.-I. Hsiao, and M. Subramanian, Coordinating Backup/Recovery and Data Consistency Between Database and File Systems, SIGMOD'02: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data (2002), pp. 500–511
  • D. Borthakur, The Hadoop Distributed File System: Architecture and Design, Apache Software Foundation (2007)
  • R.E. Bryant, Data-Intensive Supercomputing: The Case for DISC, Technical Report. CMU-CS-07-128, Carnegie Mellon University (2007)
  • J. Dean and S. Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, Proceedings of OSDI'04: Sixth Symposium on Operating System Design and Implemention, San Francisco, CA (2004)
  • J. Ekanayake, S. Pallickara, and G. Fox, MapReduce for Data Intensive Scientific Analyses, Proceedings of the IEEE International Conference on e-Science, Indianapolis (2008)
  • Hahn , Y. and Lee , B. 2005 . Identification of nine human-specific frameshift mutations by comparative analysis of the human and the chimpanzee genome sequences . Bioinformatics , : 186 – 194 .
  • Han , H. , Rivera , G. and Tseng , C.-W. 1999 . “ Compiler and Run-Time Support for Improving Locality in Scientific Code ” . In Proceedings of Languages and Compilers for Parallel Computing, Twelfth International Workshop , Srpinger-Verlag .
  • A. Hassan, R. Jones, and F. Diaz, A Case Study of Using Geographic Cues to Predict Query News Intent, GIS'09: Proceedings of the 17th ACM International Conference on Advances in Geographic Information Systems (2009)
  • D. Hitz, J. Lau, and M. Malcolm, File Systems Design for an NFS File Server Application, Winter USENIX (1994)
  • Howard , J.H. , Kazar , M.L. , Menees , S.G. , Nichols , D.A. , Satyanarayanan , M. , Sidebotham , R.N. and West , M.J. 1988 . Scale and performance in a distributed file system . ACM Trans. Comp. Syst. , 6 ( 1 )
  • A. Lohfink, T. Carnduff, N. Thomas, and M. Ware, An Object-oriented Approach to the Representation of Spatiotemporal Geographic Features, GIS'07: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems (2007), pp. 35–42
  • K.-L. Ma, A. Stompel, J. Bielak, O. Ghattas, and E. Joong Kim, Visualizing Very Large-Scale Earthquake Simulations, SC'03: Proceedings of the 2003 ACM/IEEE Conference on Supercomputing (2003)
  • G. Mackey, S. Sehrish, J. Lopez, J. Bent, S. Habib, and J. Wang, Introducing mapreduce to high end computing, in Petascale Data Storage Workshop, 2008. PDSW'08. 3rd, pp. 1–6 (2008)
  • R. Sears, C. Van Ingen, and J. Gray, To BLOB or Not To BLOB: Large Object Storage in a Database or a Filesystem?, Technical Report MSRTR-2006-45, Microsoft Research
  • S. Sehrish, G. Mackey, J. Wang, and J. Bent, Mrap: A Novel Mapreduce-based Framework to Support HPC Analytics Applications With Access Patterns, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC'10 (pp. 107–118), New York, NY, USA, ACM (2010)
  • X. Shen and A. Choudhary, DPFS: A Distributed Parallel File System, ICPP 02: Proceedings of the 2001 International Conference on Parallel Processing (pp. 533–544), Washington, DC, USA (2001)
  • I. Simeonov, H. Kilifarev, and R. Ilarionov, Algorithmic Realization of System for Short-term Weather Forecasting, CompSysTech'07: Proceedings of the 2007 International Conference on Computer Systems and Technologies (2007)
  • R. Thakur, W. Gropp, and E. Lusk, Data Sieving and Collective I/O in OMIO, frontiers, pp.182, The 7th Symposium on the Frontiers of Massively Parallel Computation (1999)
  • W. Vogels, Data Access Patterns in the Amazon.com Technology Platform, VLDB'07: Proceedings of the 33rd International Conference on Very Large Data Bases (2007)
  • G. Wang, A.R. Butt, P. Pandey, and K. Gupta, A Simulation Approach to Evaluating Design Decisions in MapReduce Setup, International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems, London, UK (2009)

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.