Dimensioning storage and computing clusters for efficient high throughput computing

E Accion; A Bria; G Bernabeu; M Caubet; M Delfino; X Espinal; G Merino; F Lopez; F Martinez; E Planas

doi:10.1088/1742-6596/396/4/042040

Journal of Physics: Conference Series

The following article is Open access

Dimensioning storage and computing clusters for efficient high throughput computing

E Accion^1,3, A Bria^1,2, G Bernabeu^1,3, M Caubet^1,3, M Delfino^1,4, X Espinal^1,2, G Merino^1,3, F Lopez^1,3, F Martinez^1,3 and E Planas^1,2

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 396, Computer Facilities, Production Grids and Networking Citation E Accion et al 2012 J. Phys.: Conf. Ser. 396 042040 DOI 10.1088/1742-6596/396/4/042040

Download Article PDF

Article metrics

294 Total downloads

Author affiliations

¹ Port d'Informació Científica (PIC), Universitat Autònoma de Barcelona, Edifici D, ES-08193 Bellaterra (Barcelona), Spain

² Also at Institut de Física d'Altes Energies (IFAE), Universitat Autònoma de Barcelona, Edifici Cn, ES-08193 Bellaterra (Barcelona), Spain

³ Also at Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid, Spain

⁴ Also at Universitat Autònoma de Barcelona, Department of Physics, ES-08193 Bellaterra (Barcelona), Spain

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

Scientific experiments are producing huge amounts of data, and the size of their datasets and total volume of data continues increasing. These data are then processed by researchers belonging to large scientific collaborations, with the Large Hadron Collider being a good example. The focal point of scientific data centers has shifted from efficiently coping with PetaByte scale storage to deliver quality data processing throughput. The dimensioning of the internal components in High Throughput Computing (HTC) data centers is of crucial importance to cope with all the activities demanded by the experiments, both the online (data acceptance) and the offline (data processing, simulation and user analysis). This requires a precise setup involving disk and tape storage services, a computing cluster and the internal networking to prevent bottlenecks, overloads and undesired slowness that lead to losses cpu cycles and batch jobs failures. In this paper we point out relevant features for running a successful data storage and processing service in an intensive HTC environment.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Please wait… references are loading.

Dimensioning storage and computing clusters for efficient high throughput computing

Article metrics

Share this article

Author affiliations

Abstract