Degree Offered
- Bachelor of Science
Nature of the Program
Data science is an interdisciplinary field with roots in applied mathematics, statistics and computer science. The Bachelor of Science in Data Science seeks to meet the increased employment demands across many industries and research fields.
Data Science majors will develop quantitative and computational skills to solve real-world problems. For example, data scientists are responsible for creating and maintaining dashboards in a pandemic, predicting traffic patterns to improve driver safety and helping apps like Uber Eats optimize food delivery. Students can customize the degree to fit their interests by selecting a focus area of their choice to create a degree with practical applications.
Working with their academic advisers, data science students will take classes in a discipline related to their interests and career goals. Students can choose their area of emphasis among a variety of areas including the social sciences, humanities, and sciences. Examples, include astronomy, biology, criminology, geography, geology, GIS, physics, public health, psychology, and sociology.
Faculty
Director of the School of Mathematical and Data Sciences
- Earl Scmie - Ph.D. University of Wisconsin, Madison
Areas: fusion energy, space plasma physics, industrial plasma physics, plasma diagnostics, neurosciences imaging, magnetic reconnection, robotics, STEM education
Professor
- Snehalata Huzurbazar - Ph.D. (Colorado State University, Fort Collins)
Areas: Statistics, Data Sciences
Assistant professor
- Srinjoy Das - Ph.D. (University of California San Diego)
Areas: Data Sciences
Admissions
- First Time Freshmen are admitted to the major directly. For the timely completion of the degree, it is recommended that students have a minimum MATH ACT of 22, a MATH SAT of 540, or an ALEKS score of 45. Students must have a placement into a Math course to receive a complete schedule for their first semester. Students without placement may not be competitive to remain in the major
- Students transferring from another WVU major with fewer than 29 credits must have completed MATH 126 with a grade of C- or higher; students who have completed 30 or more credits must have completed MATH 154 or MATH 155 with C- or higher and have earned a 2.0 overall GPA.
- Students transferring from another institution with fewer than 29 credits must have completed MATH 126 with a grade of C- or higher; students who have completed 30 or more credits must have completed MATH 154 or MATH 155 with C- or higher and have earned a 2.0 overall GPA.
Admissions Requirements for 2023-2024
The Admissions Requirements above will be the same for the 2023-2024 Academic Year.
Major Code: 14E7
General Education Foundations
Please use this link to view a list of courses that meet each GEF requirement.
NOTE: Some major requirements will fulfill specific GEF requirements. Please see the curriculum requirements listed below for details on which GEFs you will need to select.
General Education Foundations | ||
F1 - Composition & Rhetoric | 3-6 | |
Introduction to Composition and Rhetoric and Composition, Rhetoric, and Research | ||
or ENGL 103 | Accelerated Academic Writing | |
F2A/F2B - Science & Technology | 4-6 | |
F3 - Math & Quantitative Reasoning | 3-4 | |
F4 - Society & Connections | 3 | |
F5 - Human Inquiry & the Past | 3 | |
F6 - The Arts & Creativity | 3 | |
F7 - Global Studies & Diversity | 3 | |
F8 - Focus (may be satisfied by completion of a minor, double major, or dual degree) | 9 | |
Total Hours | 31-37 |
Please note that not all of the GEF courses are offered at all campuses. Students should consult with their advisor or academic department regarding the GEF course offerings available at their campus.
Degree Requirements
Student must complete the WVU General Education Foundations requirements, College B.S. requirements, major requirements, and electives to total a minimum of 120 hours. For complete details on these requirements, visit the B.S. Degrees tab on the Eberly College of Arts and Sciences.
Departmental Requirements for the B.S. in Data Science
- Capstone Requirement: The university requires the successful completion of a Capstone course. Data Science majors must complete DSCI 480.
- Writing and Communication Skills Requirements: Data Science Bachelor of Science students fulfill the Writing and Communication Skills requirement by completing ENGL 101 and ENGL 102 (or ENGL 103), and two additional SpeakWrite Certified CoursesTM:
- Calculation of the GPA in the Major: A minimum GPA of 2.5 across all classes applied to the Data Science Major Requirements is required. If a class is repeated, the second attempt will be included in the calculation of the GPA, unless it is subject to the D/F repeat policy.
- Advanced Coursework: As part of the major requirements, and in connection with their advisor, students will complete additional upper division coursework in a concentration of their choosing. Nine of the twelve credit hours must be at the 300-level or above.
- Benchmark Expectations: For details, for the the Data Science Degree Progress tab.
Curriculum Requirements
University Requirements | 31 | |
ECAS B.S. Requirements | ||
Departmental Requirements | 27 | |
Data Science Major Requirements | 62 | |
Total Hours | 120 |
University Requirements
General Education Foundations (GEF) 1, 2, 3, 4, 5, 6, 7, and 8 (31-37 Credits) | ||
Outstanding GEF Requirements 1, 4, 5, 6, and 7 | 18 | |
DSCI 191 | First-Year Seminar | 1 |
General Electives | 12 | |
Total Hours | 31 |
ECAS Bachelor of Science Requirements
Global Studies & Diversity Requirement | ||
Mathematics Requirements | ||
Fulfilled by major requirement | ||
Science Requirement | ||
Please see the Eberly College of Arts and Sciences' Bachelor of Science (B.S.) tab. |
Departmental Requirements
STEM FOUNDATIONS: | ||
Basic Core Requirements: | 19 | |
Calculus 1a with Precalculus and Calculus 1b with Precalculus | ||
or MATH 155 | Calculus 1 | |
Calculus 2 | ||
Introduction to Computer Science and Introduction to Data Structures | ||
Introduction to Probability and Statistics | ||
Data Science Foundational Science Requirement | 8 | |
Select one pair: | ||
Principles of Biology and Principles of Biology Laboratory and Introductory Physiology and Introductory Physiology Laboratory | ||
OR | ||
Fundamentals of Chemistry 1 and Fundamentals of Chemistry 1 Laboratory and Fundamentals of Chemistry 2 and Fundamentals of Chemistry 2 Laboratory | ||
OR | ||
Principles of Chemistry 1 and Principles of Chemistry 1 - Laboratory and Principles of Chemistry 2 and Principles of Chemistry 2 - Laboratory | ||
OR | ||
Introductory Physics 1 and Introductory Physics 1 Laboratory and Introductory Physics 2 and Introductory Physics 2 Laboratory | ||
OR | ||
General Physics 1 and General Physics 1 Laboratory and General Physics 2 and General Physics 2 Laboratory | ||
Total Hours | 27 |
Data Science Major Requirements
Mathematics Core: | 13 | |
Multivariable Calculus | ||
Introduction to the Concepts of Mathematics | ||
Discrete Mathematics | ||
or MATH 420 | Numerical Analysis 1 | |
Applied Linear Algebra | ||
Statistics Core: | 6 | |
Intermediate Statistical Methods | ||
Data Analysis | ||
Computer Science Core: | 6 | |
Analysis of Algorithms | ||
Databases for Data Science | ||
Data Science Core: | 22 | |
Introduction to Data Science | ||
Reproducible Data Science using R | ||
Data Science Workflows using Python | ||
Statistical Machine Learning 1 | ||
Statistical Machine Learning 2 | ||
Big Data in Practice: Cloud and Parallel Computing | ||
Current Topics in Data Science | ||
Data Science Advanced Electives | 12 | |
In consultation with an advisor, students will complete a concentration in a discipline of their choice such as Sociology, Geography, Biology or others. Students are welcome to propose concentrations that draw on their interests from the humanities, social sciences, or STEM fields where big data are collected and analyzed to provide new insights | ||
Capstone | 3 | |
Capstone in Data Science | ||
Total Hours | 62 |
Suggested Plan of Study
First Year | |||
---|---|---|---|
Fall | Hours | Spring | Hours |
DSCI 191 | 1 | DSCI 221 | 4 |
DSCI 101 | 3 | CS 111 (B.S. First Area 2) | 4 |
CS 110 (B.S. First Area 1) | 4 | MATH 156 (B.S. Second Area 1 Course 1; F8) | 4 |
MATH 155 (F3) | 4 | F5 | 3 |
F4 | 3 | ||
15 | 15 | ||
Second Year | |||
Fall | Hours | Spring | Hours |
DSCI 222 | 3 | DSCI 301 | 3 |
STAT 215 (F8 course 2) | 3 | MATH 441 | 3 |
MATH 303 | 3 | STAT 312 | 3 |
MATH 251 (B.S. Second Area 2) | 4 | GEF 6 | 3 |
DSCI Foundational Science Elective (B.S. Third Area 1; F2) | 4 | DSCI Foundational Science Elective 1 (B.S. Third Area 2; F8 course 3) | 4 |
17 | 16 | ||
Third Year | |||
Fall | Hours | Spring | Hours |
DSCI 310 | 3 | DSCI 311 | 3 |
STAT 445 | 3 | MATH 378 | 3 |
CS 320 | 3 | ECAS Global Studies and Diversity Requirement (F 7) | 3 |
ENGL 101 (GEF 1) | 3 | ENGL 102 (GEF 1) | 3 |
DSCI Advanced Science Elective 1 | 3 | DSCI Advanced Science Elective 2 | 3 |
15 | 15 | ||
Fourth Year | |||
Fall | Hours | Spring | Hours |
DSCI 410 | 3 | DSCI 480 | 3 |
DSCI 450 | 3 | Advanced Data Science Elective 4 | 3 |
DSCI Advanced Science Elective 3 | 3 | General Elective | 3 |
General Elective | 3 | General Elective | 3 |
General Elective | 3 | ||
15 | 12 | ||
Total credit hours: 120 |
Degree Progress
- By the beginning of a student's third regular semester (fall or spring), they should have completed either MATH 154 or MATH 155 with a C- or better.
- During the first four regular semesters (fall and spring) in the major, student must complete their foundational mathematics courses through MATH 441, CS 110 and CS 111, and DSCI 101, DSCI 221, and DSCI 222.
- A minimum cumulative and major GPA of a 2.0 must be maintained. Students who do not meet this benchmark will be removed from the major.
Major Learning Outcomes
Data Science
Learning Outcome 1: Students will communicate data science workflows in both written and oral forms.
Outcome 1.1 Students will demonstrate their ability to develop and use appropriate data science techniques to address ‘science’ (subject matter) topics and questions.
Outcome 1.2 Students will communicate the biases and other implications of the data and analysis.
Outcome 1.3 Students will prepare a clear and concise written project and orally present a data science workflow and analysis effectively and professionally.
Learning Outcome 2: Students will understand and demonstrate the programming and technological aspects of a data science workflow
Outcome 2.1 Students will develop workflows using the languages and platforms common in data science practice (eg. R and Python, Rstudio and JupyterLab)
Outcome 2.2 Students will demonstrate their ability to acquire and manipulate data via a variety of platforms (eg. databases to cloud computing)
Outcome 2.3 Students will demonstrate their ability to use technologies for collaboration (eg. Git and GitHub)
Learning Outcome 3: Students will demonstrate their ability to visualize and model data
Outcome 3.1 Students will demonstrate visualization of data from simple plots for smaller data sets to visualizations for big data
Outcome 3.2 Students will demonstrate their ability to use current machine learning and other data science modeling methods appropriately and understand the underlying statistical and mathematical concepts.
DSCI 101. Introduction to Data Science. 3 Hours.
Introduction and overview of this interdisciplinary field and the skills needed to work as a data scientist. Provides students basic experience in acquiring data, performing very simple analyses, and gaining an elementary understanding of data science.
DSCI 191. First-Year Seminar. 1-3 Hours.
Engages students in active learning strategies that enable effective transition to college life at WVU. Students will explore school, college and university programs, policies and services relevant to academic success. Provides active learning activities that enable effective transition to the academic environment. Students examine school, college and university programs, policies and services.
DSCI 209. Data Science Pipelines with Python and R. 3 Hours.
PR: DSCI 101 with a minimum grade of C- and MATH 124 or higher (up to MATH 156) with a minimum grade of C-. Development of workflow or computer programs to import, clean, transform, model and visualize data. Using data from different disciplines, students will program in Python and R as they develop these data science pipelines and present their results.
DSCI 221. Reproducible Data Science using R. 4 Hours.
PR: DSCI 101 and CS 110 with a minimum grade of C- in each. Introduction to programming in R and to using RStudio, and using the tidyverse set of packages to learn the basics of a data science pipeline needed to import, clean, transform, visualize and model large amounts of data.
DSCI 222. Data Science Workflows using Python. 3 Hours.
PR: DSCI 221 with a minimum grade of C-. Continuation of DSCI 221. Introduction to programming in Python, to the basics of building a data science pipeline. Students develop projects using data from various sources to develop and refine their Python skills. Also teaches the basics of terminal mode and use of bash.
DSCI 301. Databases for Data Science. 3 Hours.
PR: DSCI 209 with a minimum grade of C-. Focuses on understanding relational or categorical data structures associated with databases in a data science pipeline and acquiring data from existing databases using R and Python.
DSCI 309. Applied Machine Learning. 3 Hours.
PR: DSCI 209 with a minimum grade of C-. Statistical machine learning methods for supervised and unsupervised learning will be introduced via applications. Specifically, linear regression, methods for classification, resampling, model choice, dimension reduction and clustering will be covered with a conceptual understanding and their implementation using R and Python.
DSCI 310. Statistical Machine Learning 1. 3 Hours.
PR: DSCI 222 and STAT 312 and MATH 441 with a minimum grade of C- in each. Focuses on a conceptual understanding of the methods and their implementation using R and Python. Covers linear regression; classification methods (logistic regression, linear discriminant analysis and K-nearest neighbors); resampling methods (cross-validation and bootstrap); model choice methods (subset and stepwise selection, shrinkage methods); dimension reduction methods (principal components analysis).
DSCI 311. Statistical Machine Learning 2. 3 Hours.
PR: DSCI 310 with a minimum grade of C-. Continuation of DSCI 310. Covers statistical machine learning methods that are not strictly linear, such as models based on splines, tree-structures, support vector machines and unsupervised methods. Emphasizes a conceptual understanding and application of the methods using R and Python.
DSCI 409. Advanced Case Studies in Data Science. 3 Hours.
PR: DSCI 309 with a minimum grade of C-. The course covers advanced methods through case studies. Four main topics will be computing and analyzing data using the high performance computing, and case studies with natural language processing, real-time streaming data and imaging data. Concurrently students will acquire data from their own major and put together a data science pipeline and analysis for their final project.
DSCI 410. Big Data in Practice: Cloud and Parallel Computing. 3 Hours.
PR: DSCI 311 with a minimum grade of C-. Extends the R “tidyverse” data manipulation and machine learning pipelines to relational database tables; big data; network data; streaming data. Students will develop their abilities from using RStudio locally on a laptop to using it on a server, with technologies such as Spark.
DSCI 450. Current Topics in Data Science. 3 Hours.
PR: DSCI 311 with a minimum grade of C-. Exploration of timely current topics where data science is used; exploration and discussion of biases and other aspects of decisions made as a result of data science tools.