Master of Science Online in Data Science: Curriculum
The Master of Science in Data Science program online begins with a foundational track required for students without appropriate math and/or programming experience. After completing core courses, students can choose from two specializations: AI & Machine Learning and Big Data Analytics.
Master of Science Online in Data Science: Curriculum
The Master of Science in Data Science program online begins with a foundational track required for students without appropriate math and/or programming experience. After completing core courses, students can choose from two specializations: AI & Machine Learning and Big Data Analytics.
DS 003 - Lorem ipsum dolor sit amet
Phasellus vel commodo urna. Class aptent taciti sociosqu ad litora torquent per conubia nostra, per inceptos himenaeos. Phasellus egestas consequat dignissim. In sodales nisi vitae tortor vestibulum imperdiet. Aenean vehicula est vitae ante aliquam consequat. Cras eu odio et magna egestas finibus id nec orci.
DS 501 - Introduction to Data Science
Introduction to Data Science provides an overview of Data Science, covering a broad selection of key challenges in and methodologies for working with big data. Topics include: data collection, integration, management, modeling, analysis, visualization, prediction and informed decision-making, as well as data security and data privacy. This introductory course is integrative across the core disciplines of Data Science, including databases, data warehousing, statistics, data mining, data visualization, high-performance computing, cloud computing, and business intelligence. Professional skills, such as communication, presentation, and storytelling with data will be fostered.
Select 2 Categories from below.
AI and Machine Learning Focus Area
This category satisfies one of two requirements for this section.
CS 5007 - Introduction to Programming Concepts, Data Structures and Algorithms
This is an introductory graduate course teaching core computer science topics typically found in an undergraduate Computer Science curriculum, but at a graduate-level pace. It is primarily intended for students with little formal preparation in Computer Science to gain experience with fundamental Computer Science topics. After a review of programming concepts the focus of the course will be on data structures from the point of view of the operations performed upon the data and to apply analysis and design techniques to non-numeric algorithms that act on data structures. The data structures covered include lists, stacks, queues, trees and graphs. Projects will focus on the writing of programs to appropriately integrate data structures and algorithms for a variety of applications. This course may not be used to satisfy degree requirements for a B.S., M.S., or Ph.D. degree in Computer Science or a minor in Computer Science.
DS 502 - Statistical Methods for Data Science
This course surveys the statistical methods most useful in data science applications. Topics covered include predictive modeling methods, including multiple linear regression, and time series; data dimension reduction; Discrimination and classification methods, clustering methods; and committee methods. Students will implement these methods using statistical software. Prerequisites: Statistics at the level of MA 2611 and MA2612 and linear algebra at the level of MA 2071.
CS 548 - Knowledge Discovery and Data Mining
This course presents current research in Knowledge Discovery in Databases (KDD) dealing with data integration, mining, and interpretation of patterns in large collections of data. Topics include data warehousing and data preprocessing techniques; data mining techniques for classification, regression, clustering, deviation detection, and association analysis; and evaluation of patterns minded from data. Industrial and scientific applications are discussed. Recommended background: Background in artificial intelligence, databases, and statistics at the undergraduate level, or permission of the instructor. Proficiency in a high level programming language.
DS 517 - Math Foundations for DS
The foci of this class are the essential statistics and linear algebra skills required for Data Science students. The class builds the foundation for theoretical and computational abilities of the students to analyze high dimensional data sets. Topics covered include Bayes’ theorem, the central limit theorem, hypothesis testing, linear equations, linear transformations, matrix algebra, eigenvalues and eigenvectors, and sampling techniques, including Bootstrap and Markov chain Monte Carlo. Students will use these techniques while engaging in hands-on projects with real data. Prerequisites: Some knowledge of integral and differential calculus is recommended.
Select one of the courses from the courses listed below.
CS 534 - Artificial Intelligence
This course gives a broad survey of artificial intelligence. The course will cover methods from search, probabilistic reasoning, and learning, among other topics. Selected topics involving the applications of these tools are investigated. Such topics might include natural language understanding, scene understanding, game playing, and planning. (Prerequisites: familiarity with data structures and a high-level programming language.)
CS 542 - Database Management Systems
An introduction to the theory and design of database management systems. Topics covered include internals of database management systems, fundamental concepts in database theory, and database application design and development. In particular, logical design and conceptual modeling, physical d atabase design strategies, relational data model and query languages, query optimization, transaction management and distributed databases. Typically there are hands-on assignments and/or a course project. Selected topics from the current database research literature may be touched upon as well.
DS 598 - Data Science Capstone Experience
This 3-credit graduate qualifying project, typically done in teams, is to be carried out in cooperation with a sponsor or industrial partner. It must be overseen by a faculty member affiliated with the Data Science Program. This offering integrates theory and practice of Data Science, and should include the utilization of tools and techniques acquired in the Data Science Program. In addition to a written report, this project must be presented in a formal presentation to faculty of the Data Science program and sponsors. Professional development skills, such as communication, teamwork, leadership, and collaboration, along with storytelling, will be practiced. Prerequisite: DS 501, completion of at least 24 credits of the DS degree, or consent of the instructor.
CS 539 - Machine Learning
The focus of this course is machine learning for knowledge-based systems. It will include reviews of work on similarity-based learning (induction), explanation-based learning, analogical and case-based reasoning and learning, and knowledge compilation. It will also consider other approaches to automated knowledge acquisition as well as connectionist learning.
DS 503 - Big Data Management
Emerging applications in science and engineering disciplines generate and collect data at unprecedented speed, scale, and complexity that need to be managed and analyzed efficiently. This course introduces the emerging techniques and infrastructures developed for big data management including parallel and distributed database systems, map-reduce infrastructures, scalable platforms for complex data types, stream processing systems, and cloud-based computing. Query processing, optimization, access methods, storage layouts, and energy management techniques developed on these infrastructures will be covered. Students are expected to engage in hands-on projects using one or more of these technologies.
Select one of the courses from the courses listed below.
CS 573 - Data Visualization
This course exposes students to the field of data visualization, i.e., the graphical communication of data and information for the purposes of presentation, confirmation, and exploration. The course introduces the stages of the visualization pipeline. This includes data modeling, mapping data attributes to graphical attributes, visual display techniques, tools, paradigms, and perceptual issues. Students learn to evaluate the effectiveness of visualizations for specific data, task, and user types. Students implement visualization algorithms and undertake projects involving the use of commercial and public-domain visualization tools. Students also read papers from the current visualization literature and do classroom presentations.
MIS 584 - Business Intelligence
Today’s business computing infrastructures are producing the large volumes of data organizations need to make better plans and decisions. This course provides an introduction to the processes, technologies, and techniques for organizing, analyzing, visualizing, and interpreting data and information about business operations in a way that creates business value. During the course, students will study a variety of business decisions that can be improved by analyzing data about customers, sales, and operations, preparing students to be knowledgeable producers and consumers of business intelligence. Students will apply commercially available business intelligence software to develop performance dashboards to facilitate organizational decision-making. The course explores the technical challenges of organizing, analyzing, and presenting data and the managerial challenges of creating and deploying business intelligence expertise in organizations. The course includes business cases, in-class discussion, and hands-on analyses of business data. It is designed for any student interested in learning about data-driven business performance management and decision-making, including students whose primary focus is Data Science, IT, Marketing, Operations, or Business Management.
MKT 568 - Data Mining Business Applications
This course provides students with the key concepts and tools to turn raw data into useful business intelligence. A broad spectrum of business situations will be considered for which the tools of classical statistics and modern data mining have proven their usefulness. Problems considered will include such standard marketing research activities as customer segmentation and customer preference as well as more recent issues in credit scoring, churn management and fraud detection. Roughly half the class time will be devoted to discussions on business situations, data mining techniques, their application and their usage. The remaining time will comprise an applications laboratory in which these concepts and techniques are used and interpreted to solve realistic business problems. Some knowledge of basic marketing principles and basic data analysis is assumed.
DS 503 - Big Data Management
Emerging applications in science and engineering disciplines generate and collect data at unprecedented speed, scale, and complexity that need to be managed and analyzed efficiently. This course introduces the emerging techniques and infrastructures developed for big data management including parallel and distributed database systems, map-reduce infrastructures, scalable platforms for complex data types, stream processing systems, and cloud-based computing. Query processing, optimization, access methods, storage layouts, and energy management techniques developed on these infrastructures will be covered. Students are expected to engage in hands-on projects using one or more of these technologies.
MIS 587 - Business Applications in Machine Learning
This course explores how Machine Learning (ML) and Artificial Intelligence (AI) is applied to solve business problems, to satisfy specific business needs, or to discover new opportunities for businesses. Applications of ML and AI are constantly evolving across many industries. This course utilizes existing AutoML solutions to address issues identified in business case studies (e.g. predicting hospital readmissions, loans likely to default, customer churn). The course covers the machine learning project life cycle starting with defining ML project objectives, acquiring and exploring data, modeling using AutoML tools, interpretation of models and communication of outcomes, and implementation and deployment of predictive models in organizations.
Computing System Focus Area
This category satisfies one of two requirements for this section.
CS 502 - Operating Systems
The design and theory of multi-programmed operating systems, concurrent processes, process communication, input/output supervisors, memory management, resource allocation and scheduling are studied.
Select one of the courses from the courses listed below.
CS 513 - Computer Networks
This course provides an introduction to the theory and practice of the design of computer and communications networks, including the ISO seven-layer reference model. Analysis of network topologies and protocols, including performance analysis, is treated. Current network types including local area and wide area networks are introduced, as are evolving network technologies. The theory, design and performance of local area networks are emphasized. The course includes an introduction to queueing analysis and network programming.
CS 548 - Knowledge Discovery and Data Mining
This course presents current research in Knowledge Discovery in Databases (KDD) dealing with data integration, mining, and interpretation of patterns in large collections of data. Topics include data warehousing and data preprocessing techniques; data mining techniques for classification, regression, clustering, deviation detection, and association analysis; and evaluation of patterns minded from data. Industrial and scientific applications are discussed. Recommended background: Background in artificial intelligence, databases, and statistics at the undergraduate level, or permission of the instructor. Proficiency in a high level programming language.
CS 525 - ST: Cloud Computing
Modern data centers are massive warehouses that host hundreds of thousands of physical servers. Those large amount of interconnected servers provide the infrastructure foundations for today's evolving cloud computing platforms. Cloud computing, with its clear economic benefits and flexible resource offerings, has gained increasing popularity over the past decade. Today's cloud platforms host a plethora of services, including traditional web service, mobile backend, and big data analytic, and allow customers the freedom to deploy full stack applications in the cloud. In this course, we will discuss recent research on cloud computing and data centers, with the goals of better understanding and exploring the key challenges faced by the large scale data centers and cloud platforms. We will cover topics including but not limited to cloud computing services, cluster scheduling and provisioning, cloud-based machine learning, and interaction with mobile/edge computing. The course is structured with activities including reading, presentation, discussion, and programming to help students gain both conceptual understanding and practical experiences with cloud computing. For the final project, students will work on a research problem in cloud computing and deliver a research paper and a demo.
CS 539 - Machine Learning
The focus of this course is machine learning for knowledge-based systems. It will include reviews of work on similarity-based learning (induction), explanation-based learning, analogical and case-based reasoning and learning, and knowledge compilation. It will also consider other approaches to automated knowledge acquisition as well as connectionist learning. (Prerequisite: CS 534 or equivalent, or permission of the instructor.)
CS 577 - Advanced Computer and Communications Networks
This course covers advanced topics in the theory, design and performance of computer and communications networks. Topics will be selected from such areas as local area networks, metropolitan area networks, wide area networks, queueing models of networks, routing, flow control, new technologies and protocol standards. The current literature will be used to study new networks concepts and emerging technologies.
DS 504 - Big Data Analytics
Innovation and discoveries are no longer hindered by the ability to collect data, but the ability to summarize, analyze, and discover knowledge from the collected data in a scalable fashion. This course covers computational techniques and algorithms for analyzing and mining patterns in large-scale datasets. Techniques studied address data analysis issues related to data volume (scalable and distributed analysis), data velocity (high-speed data streams), data variety (complex, heterogeneous, or unstructured data), and data veracity (data uncertainty). Techniques include mining and machine learning techniques for complex data types, and scaleup and scale-out strategies that leverage big data infrastructures. Real-world applications using these techniques, for instance social media analysis and scientific data mining, are selectively discussed. Students are expected to engage in hands-on projects using one or more of these technologies. Prerequisites: A beginning course in databases and a beginning course in data mining, or equivalent knowledge, and programming experience.
This category satisfies one of two requirements for this section.
CS 571 - Case Studies in Computer Security
This course examines security challenges and failures holistically, taking into account technical concerns, human behavior, and business decisions. Using a series of detailed case studies, students will explore the interplay among these dimensions in creating secure computing systems and infrastructure. Students will also apply lessons from the case studies to emerging secure-systems design problems. The course requires active participation in class discussions, presentations, and writing assignments. It does not involve programming, but assumes that students have substantial prior experience with security protocols, attacks, and mitigations at the implementation level. This course satisfies the behavioral component of the MS specialization in computer security.
CS 557 - Software Security Design and Analysis
oftware is responsible for enforcing many central security goals in computer systems. These goals include authenticating users and other external principals, authorizing their actions, and ensuring the integrity and confidentiality of their data. This course studies how to design, implement, and analyze mechanisms to enforce these goals in both web systems and programs in traditional languages. Topics include: identifying programming choices that lead to reliable or flawed security outcomes, successful and unsuccessful strategies for incorporating cryptography into software, and analysis techniques that identify security vulnerabilities. The course will cover both practical and theoretical aspects of secure software, and will include a substantial secure software design project.
CS 558 - Computer Network Security
This course covers core security threats and mitigations at the network level. Topics include: denial-of-service, network capabilities, intrusion detection and prevention systems, worms, botnets, Web attacks, anonymity, honeypots, cybercrime (such as phishing), and legality and ethics. The course prepares students to think broadly and concretely about network security; it is not designed to teach students low-level tools for monitoring or maintaining system security. Assignments and projects will assess each student’s ability to think both conceptually and practically about network security.
MIS 584 - Business Intelligence
Today’s business computing infrastructures are producing the large volumes of data organizations need to make better plans and decisions. This course provides an introduction to the processes, technologies, and techniques for organizing, analyzing, visualizing, and interpreting data and information about business operations in a way that creates business value. During the course, students will study a variety of business decisions that can be improved by analyzing data about customers, sales, and operations, preparing students to be knowledgeable producers and consumers of business intelligence. Students will apply commercially available business intelligence software to develop performance dashboards to facilitate organizational decision-making. The course explores the technical challenges of organizing, analyzing, and presenting data and the managerial challenges of creating and deploying business intelligence expertise in organizations. The course includes business cases, in-class discussion, and hands-on analyses of business data. It is designed for any student interested in learning about data-driven business performance management and decision-making, including students whose primary focus is Data Science, IT, Marketing, Operations, or Business Management.
Business Intelligence Focus Area
This category satisfies one of two requirements for this section.
MIS 584 - Business Intelligence
Today’s business computing infrastructures are producing the large volumes of data organizations need to make better plans and decisions. This course provides an introduction to the processes, technologies, and techniques for organizing, analyzing, visualizing, and interpreting data and information about business operations in a way that creates business value. During the course, students will study a variety of business decisions that can be improved by analyzing data about customers, sales, and operations, preparing students to be knowledgeable producers and consumers of business intelligence. Students will apply commercially available business intelligence software to develop performance dashboards to facilitate organizational decision-making. The course explores the technical challenges of organizing, analyzing, and presenting data and the managerial challenges of creating and deploying business intelligence expertise in organizations. The course includes business cases, in-class discussion, and hands-on analyses of business data. It is designed for any student interested in learning about data-driven business performance management and decision-making, including students whose primary focus is Data Science, IT, Marketing, Operations, or Business Management.
MKT 568 - Data Mining Business Applications
This course provides students with the key concepts and tools to turn raw data into useful business intelligence. A broad spectrum of business situations will be considered for which the tools of classical statistics and modern data mining have proven their usefulness. Problems considered will include such standard marketing research activities as customer segmentation and customer preference as well as more recent issues in credit scoring, churn management and fraud detection. Roughly half the class time will be devoted to discussions on business situations, data mining techniques, their application and their usage. The remaining time will comprise an applications laboratory in which these concepts and techniques are used and interpreted to solve realistic business problems. Some knowledge of basic marketing principles and basic data analysis is assumed.