3.00 Credits
This course covers advanced study and practice in using a modem scripting language to integrate off-the-shelf code libraries for the retrieval of unstructured and partially structured data, and for the cleaning, integration, formatting, storage, analysis, and visualization of large data sets. Modem scripting languages include powerful built-in features for storing, retrieving, mapping, and integrating data; code libraries extend such features greatly. Libraries include those for regular expression based extraction of textual data, data integration, statistical analysis and correlation, machine learning, natural language processing, machine vision and listening, visualization, and storage in files and database systems. Emphasis is on using a scripting language to glue together off-the-shelf library modules without writing the complex, underlying library code.