TY - GEN
T1 - Design principles for effective knowledge discovery from big data
AU - Begoli, Edmon
AU - Horey, James
PY - 2012
Y1 - 2012
N2 - Big data phenomenon refers to the practice of collection and processing of very large data sets and associated systems and algorithms used to analyze these massive datasets. Architectures for big data usually range across multiple machines and clusters, and they commonly consist of multiple special purpose sub-systems. Coupled with the knowledge discovery process, big data movement offers many unique opportunities for organizations to benefit (with respect to new insights, business optimizations, etc.). However, due to the difficulty of analyzing such large datasets, big data presents unique systems engineering and architectural challenges. In this paper, we present three system design principles that can inform organizations on effective analytic and data collection processes, system organization, and data dissemination practices. The principles presented derive from our own research and development experiences with big data problems from various federal agencies, and we illustrate each principle with our own experiences and recommendations.
AB - Big data phenomenon refers to the practice of collection and processing of very large data sets and associated systems and algorithms used to analyze these massive datasets. Architectures for big data usually range across multiple machines and clusters, and they commonly consist of multiple special purpose sub-systems. Coupled with the knowledge discovery process, big data movement offers many unique opportunities for organizations to benefit (with respect to new insights, business optimizations, etc.). However, due to the difficulty of analyzing such large datasets, big data presents unique systems engineering and architectural challenges. In this paper, we present three system design principles that can inform organizations on effective analytic and data collection processes, system organization, and data dissemination practices. The principles presented derive from our own research and development experiences with big data problems from various federal agencies, and we illustrate each principle with our own experiences and recommendations.
KW - Big Data
KW - architecture
KW - design principles
UR - http://www.scopus.com/inward/record.url?scp=84870689823&partnerID=8YFLogxK
U2 - 10.1109/WICSA-ECSA.212.32
DO - 10.1109/WICSA-ECSA.212.32
M3 - Conference contribution
AN - SCOPUS:84870689823
SN - 9780769548272
T3 - Proceedings of the 2012 Joint Working Conference on Software Architecture and 6th European Conference on Software Architecture, WICSA/ECSA 2012
SP - 215
EP - 218
BT - Proceedings of the 2012 Joint Working Conference on Software Architecture and 6th European Conference on Software Architecture, WICSA/ECSA 2012
T2 - Joint 10th Working IEEE/IFIP Conference on Software Architecture, WICSA 2012 and 6th European Conference on Software, ECSA 2012
Y2 - 20 August 2012 through 24 August 2012
ER -