TY - GEN
T1 - SSD-optimized workload placement with adaptive learning and classification in HPC environments
AU - Wan, Lipeng
AU - Lu, Zheng
AU - Cao, Qing
AU - Wang, Feiyi
AU - Oral, Sarp
AU - Settlemyer, Bradley
PY - 2014
Y1 - 2014
N2 - In recent years, non-volatile memory devices such as SSD drives have emerged as a viable storage solution due to their increasing capacity and decreasing cost. Due to the unique capability and capacity requirements in large scale HPC (High Performance Computing) storage environment, a hybrid configuration (SSD and HDD) may represent one of the most available and balanced solutions considering the cost and performance. Under this setting, effective data placement as well as movement with controlled overhead become a pressing challenge. In this paper, we propose an integrated object placement and movement framework and adaptive learning algorithms to address these issues. Specifically, we present a method that shuffle data objects across storage tiers to optimize the data access performance. The method also integrates an adaptive learning algorithm where realtime classification is employed to predict the popularity of data object accesses, so that they can be placed on, or migrate between SSD or HDD drives in the most efficient manner. We discuss preliminary results based on this approach using a simulator we developed to show that the proposed methods can dynamically adapt storage placements and access pattern as workloads evolve to achieve the best system level performance such as throughput.
AB - In recent years, non-volatile memory devices such as SSD drives have emerged as a viable storage solution due to their increasing capacity and decreasing cost. Due to the unique capability and capacity requirements in large scale HPC (High Performance Computing) storage environment, a hybrid configuration (SSD and HDD) may represent one of the most available and balanced solutions considering the cost and performance. Under this setting, effective data placement as well as movement with controlled overhead become a pressing challenge. In this paper, we propose an integrated object placement and movement framework and adaptive learning algorithms to address these issues. Specifically, we present a method that shuffle data objects across storage tiers to optimize the data access performance. The method also integrates an adaptive learning algorithm where realtime classification is employed to predict the popularity of data object accesses, so that they can be placed on, or migrate between SSD or HDD drives in the most efficient manner. We discuss preliminary results based on this approach using a simulator we developed to show that the proposed methods can dynamically adapt storage placements and access pattern as workloads evolve to achieve the best system level performance such as throughput.
UR - http://www.scopus.com/inward/record.url?scp=84905455073&partnerID=8YFLogxK
U2 - 10.1109/MSST.2014.6855552
DO - 10.1109/MSST.2014.6855552
M3 - Conference contribution
AN - SCOPUS:84905455073
SN - 9781479956715
T3 - IEEE Symposium on Mass Storage Systems and Technologies
BT - 2014 30th Symposium on Mass Storage Systems and Technologies, MSST 2014
PB - IEEE Computer Society
T2 - 30th Symposium on Massive Storage Systems and Technologies, MSST 2014
Y2 - 2 June 2014 through 6 June 2014
ER -