4.00 Credits
We are living in data-intensive world. Efficiently extracting, interpreting, and learning from very large datasets requires efficient and scalable algorithms as well as new data management technologies. Machine learning techniques and high performance computing make the efficient analysis of large volumes of data. In this course we explore big dataset analysis techniques and apply it to the distributed. This course is highly interactive. Students are expected to make use of technologies to design highly scalable systems that can process and analyze Big Data for a variety of scientific, social, and environmental challenges.
Prerequisite:
CMSC 310 (Grade of C or Higher) and MATH 217