TY - GEN
T1 - Descriptive data analysis of file transfer data
AU - Srinivasan, Sudarshan
AU - Hazlewood, Victor
AU - Peterson, Gregory D.
PY - 2014
Y1 - 2014
N2 - There are millions of files and multi-terabytes of data trans-ferred to and from the University of Tennessee's National Institute for Computational Sciences each month. New ca-pabilities available with GridFTP version 5.2.2 include ad-ditional transfer log information previously unavailable in prior versions implemented within XSEDE. The transfer log data now available includes identification of source and destination endpoints which unlocks a wealth of informa-tion that can be used to detail GridFTP activities across the Internet. This information can be used for a wide va-riety of reports of interest to individual XSEDE Service Providers and to XSEDE Operations. In this paper, we discuss the new capabilities available for transfer logs in GridFTP 5.2.2, our initial attempt to organize, analyze, and report on this file transfer data for NICS, and its applica-bility to XSEDE Service Providers. Analysis of this new information can provide insight into effective and efficient utilization of GridFTP resources including identification of potential areas of GridFTP file transfer improvement (e.g., network and server tuning) and potential predictive analysis to improve efficiency.
AB - There are millions of files and multi-terabytes of data trans-ferred to and from the University of Tennessee's National Institute for Computational Sciences each month. New ca-pabilities available with GridFTP version 5.2.2 include ad-ditional transfer log information previously unavailable in prior versions implemented within XSEDE. The transfer log data now available includes identification of source and destination endpoints which unlocks a wealth of informa-tion that can be used to detail GridFTP activities across the Internet. This information can be used for a wide va-riety of reports of interest to individual XSEDE Service Providers and to XSEDE Operations. In this paper, we discuss the new capabilities available for transfer logs in GridFTP 5.2.2, our initial attempt to organize, analyze, and report on this file transfer data for NICS, and its applica-bility to XSEDE Service Providers. Analysis of this new information can provide insight into effective and efficient utilization of GridFTP resources including identification of potential areas of GridFTP file transfer improvement (e.g., network and server tuning) and potential predictive analysis to improve efficiency.
KW - Data analysis
KW - Database loading
KW - Log transfer
UR - http://www.scopus.com/inward/record.url?scp=84905499136&partnerID=8YFLogxK
U2 - 10.1145/2616498.2616550
DO - 10.1145/2616498.2616550
M3 - Conference contribution
AN - SCOPUS:84905499136
SN - 9781450328937
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the XSEDE 2014 Conference
PB - Association for Computing Machinery
T2 - 2014 Annual Conference on Extreme Science and Engineering Discovery Environment, XSEDE 2014
Y2 - 13 July 2014 through 18 July 2014
ER -