Statistics

http://www.stat.wvu.edu

Degrees Offered

  • Master of Science

Nature of the Program

The Department of Statistics offers a Master of Science (M.S.) in Statistics.   The department also offers a Certificate in Applied Statistics (C.A.S.). The M.S. degree is intended to qualify the student to assume a professional role in educational, industrial, or governmental research projects; to teach in a college; or to undertake advanced training toward a doctorate in statistics or one of the quantitative fields of science.

Because many students receive baccalaureate degrees from colleges that do not offer undergraduate programs in statistics, and because historically statistics has been primarily a field of graduate education, a student does not need a degree in statistics to enter the degree programs in statistics. A good background in mathematics, science, or engineering is reasonable preparation for graduate work in statistics.

The Department of Statistics also participates in the Combinatorial Computing and Discrete Mathematics (CCDM) Area of Emphasis within the Computer and Information Science Ph.D. Program or the Mathematics Ph.D. Program.

Admissions and Prerequisites for Master of Science in Statistics

Students are expected to know the material contained in the following courses or areas upon admission to the program. Otherwise, these deficiencies must be removed as early as possible in the student’s degree program under the terms specified by the Admissions and Standards Committee.

  • Single and multivariable calculus (MATH 155, MATH 156, MATH 251, or equivalent)
  • Linear or matrix algebra (MATH 441 or equivalent)
  • Probability and statistics (STAT 215 or equivalent)
  • Knowledge of a high-level programming language

 

Admissions and Prerequisites for the Certificate in Applied Statistics

Students can apply for the Certificate in Applied Statistics during the time, or prior to the time, they are taking STAT 511 or STAT 512. Students who have taken courses beyond these two basic courses can petition the Academic Standards Committee for acceptance into the program.

The prerequisite for admission is a college algebra course. Single and multi-variable calculus are recommended but not required for admission. However, certain elective courses have calculus prerequisites. Applicants must have a baccalaureate degree.

Beyond the above mathematical entrance requirements, the Admissions and Standards Committee will base admission on the following material submitted by the applicant:

  • Resume or curriculum vitae
  • Transcript

The GRE General Test is not required for admission.

To obtain a Master of Science in Statistics, the student must complete the course and comprehensive examination requirements.  The student must maintain a minimum GPA of 3.0 and earn a grade of C or better in all courses counting toward the degree.

Master of Science

Major Requirements

To obtain a Master of Science in Statistics, the student must complete the course and comprehensive examination requirements.

Minimum cumulative GPA of 3.0 is required.
Minimum grade of C- in all courses applied toward the degree
STAT 512Statistical Methods 23
STAT 513Design of Experiments3
STAT 545Applied Regression Analysis3
STAT 555Categorical Data Analysis3
STAT 561Theory of Statistics 13
STAT 562Theory of Statistics 23
Electives (STAT 462, any 500-, 600-, or 700-level STAT courses except STAT 511 or STAT 516) *9
Select either the non-thesis or thesis option6
Non-Thesis Option:
Electives (STAT 462, any 500-, 600-, or 700-level STAT courses except STAT 511 or STAT 516) *
Thesis Option:
Research
Thesis
Thesis Defense
Comprehensive Examination
Total Hours33

* Non-STAT electives require departmental consent.

Examinations

Students must pass a written comprehensive examination on foundational material. The examination covers the theory taught in STAT 461 and STAT 462 and the applications taught in STAT 512STAT 513, and STAT 545. The exam is given twice a year on the Thursday during the second full week following spring semester final exams and on the third Saturday in October. Students have a maximum of three attempts for this exam.

Certificate in Applied Statistics

The Certificate in Applied Statistics (C.A.S.) can be earned through traditional classroom delivery as well as distance-based delivery. Many of the courses are offered both on-campus and online, including the required courses and the SAS and Data Science tracks.

The C.A.S. is designed for professionals or students who want applied statistics and data science courses to enhance their job opportunities and quantitative skills. If prerequisite requirements are met, theoretical courses in statistics are available as an option. This certificate program is designed for both on and off-campus delivery.
This certificate program will provide students with a solid foundation in statistical methods, predictive analytics, statistical computing, and data technologies. This program will enhance the quantitative skills of professionals or the research productivity of graduate students.
The C.A.S. is based on a coherent group of courses representing a specialized area of knowledge. To meet this objective each student must choose a track reflecting his or her interests. Certain tracks, e.g., the SAS, Data Science, and Modeling tracks, are offered as on and off-campus options. Other tracks are residential, although certain courses within these tracks may be offered in a distance-based format.

To maintain their status and to receive a Certificate in Applied Statistics, students must maintain at least a 3.0 GPA in courses counted towards the certificate. Students falling below a 3.0 GPA for a semester on courses counting towards this degree will be notified, and they will have one year to raise their GPA to a 3.0.

Required Courses

Students in the certificate program must complete at least fifteen credit hours. The Certificate in Applied Statistics should represent a distinct area of study, which builds on the foundational required statistics courses. The distinct areas of study are: SAS, data science, modeling, applied statistics, and mathematical statistics. The courses required for the completion of each of these areas are defined below.

STAT 511Statistical Methods 13
STAT 512Statistical Methods 23
Three courses from one of the tracks listed below9
Total Hours15

Track options and courses

SAS TRACK:

STAT 521Statistical Analysis System Programming3
STAT 522Advanced Statistical Analysis System Programming3
STAT 540Introduction to Exploratory Data Analysis3

Data Science Track:

STAT 523Statistical Computing3
STAT 623Data Technologies3
STAT 624High Performance Analytics3

Modeling Track: 

STAT 541Applied Multivariate Analysis3
STAT 545Applied Regression Analysis3
STAT 547Survival Analysis3

Applied Statistics Track: 

STAT 513Design of Experiments3
STAT 531Sampling Theory and Methods3
STAT 551Nonparametric Statistics3

Mathematical Statistics Track:

STAT 516Forensic Statistics3
STAT 561Theory of Statistics 13
STAT 562Theory of Statistics 23
Total Hours9


Customized track:

A student can propose a customized track to the Admissions and Standards Committee for approval based on three cohesive courses selected from 500 or 600-level courses taught by the Department of Statistics or two courses taught by the Department of Statistics and one statistics course taught by another department.

Capstone experience

The certificate capstone will test the student's ability to apply statistical methods to a real dataset using statistical computing tools. The student is required to develop a reproducible report including a brief description of the methodologies used, the results, and the conclusions. The report is limited to ten pages. The dataset type and methods used will depend on the track the student has chosen.

The capstone experience will be administered by the student's advisor. The student will be given five days to complete the report at a date mutually agreed upon by the student and his or her advisor at or near the end of the student's coursework. The advisor will determine whether the student passes or fails based on an agreed upon rubric. A student who fails will receive written information on the reasons for failure and be given three additional days to improve his or her report to the standards required for passing.

Major Learning Goals

statistics

Graduate courses in statistics, and sequences of statistics courses leading to a Master of Science in Statistics or a Certificate in Applied Statistics, provide a foundation of statistical literacy, statistical reasoning, and statistical thinking.  Our aim is for all of our students to be challenged and encouraged in their statistical course work.  In particular, we enable our students to

  • Appreciate the inherent variation and uncertainty of information, and understand that statistics can be a resource for improved decision making;
  • Develop critical thinking skills for application of statistics;
  • Effectively communicate the results of statistical analysis;
  • Become responsible and competent practitioners of statistics in order to attain personal goals, either in a profession or in further educational experiences.

Courses

STAT 505. Foundations of Probability and Statistics. 3 Hours.

PR:MATH 156 or consent. Probability, random variables, discrete and continuous probability distributions, point and interval estimation, chi-square tests, linear regression, and correlation.

STAT 511. Statistical Methods 1. 3 Hours.

PR: MATH 126. Statistical models, distributions, probability, random variables, tests of hypotheses, confidence intervals, regression, correlation, transformations, F and Chi-square distributions, analysis of variance and multiple comparisons. (Equivalent to EDP 613 and PSYC 511.).

STAT 512. Statistical Methods 2. 3 Hours.

PR: STAT 511 or equivalent. Completely random, randomized complete block, Latin square, and split-plot experimental designs. Unplanned and planned multiple and orthogonal comparisons for qualitative and quantitative treatments and factorial arrangements. Multiple linear regression and covariance analysis. (Equivalent to EDP 614 and PSYC 512.).

STAT 513. Design of Experiments. 3 Hours.

PR: STAT 512 or equivalent. Expected mean squares, power of tests and relative efficiency for various experimental designs. Fixed, random, and mixed models. Use of sub-sampling, covariance, and confounding to increase power and efficiency.

STAT 516. Forensic Statistics. 3 Hours.

PR: STAT 215 or equivalent. Probabilistic and statistical evaluation of evidence in forensic science: concepts of uncertainty variation, discriminating power, coincidence/significance probabilities, historical overview, transfer evidence, DNA profiling, fingerprint identification, biometric identification, and case studies.

STAT 521. Statistical Analysis System Programming. 3 Hours.

PR: STAT 511 or equivalent. Topics in Statistical Analysis System (SAS). Students perform statistical data analyses, data modifications and manipulations, file operations, and statistical report writing. Prepares students for the SAS Base Programming certification exam.

STAT 522. Advanced Statistical Analysis System Programming. 3 Hours.

PR: STAT 521 or consent. Advanced topics in Statistical Analysis System (SAS); SAS SQL to generate reports, join tables, construct queries; SAS Macrolanguage basics; write/implement SAS macro programs. Prepares students for SAS Advanced Programmer Certification Exam.

STAT 523. Statistical Computing. 3 Hours.

PR: STAT 512. Monte Carlo methods; randomization, partitioning, and the bootstrap; identifying data structures, estimating functions, including density functions; statistical models of dependencies. R programming.

STAT 525. Statistical Graphics. 3 Hours.

PR: STAT 512. Introduction to R graphics; traditional graphs; the grid graphics model; lattice graphics; developing new graphics functions and objects in R. Visualizing large datasets.

STAT 531. Sampling Theory and Methods. 3 Hours.

PR: STAT 511 or consent. Survey components, methods of sampling for finite and infinite populations, single and multi-stage procedures, confidence limits for estimating population parameters, sample size determination, area sampling sources of survey error, and basic inference derived from survey design.

STAT 540. Introduction to Exploratory Data Analysis. 3 Hours.

PR: An introductory statistics course. Basic ways in which observations given in counted and measured form are approached. Pictorial and arithmetic techniques of display and discovery. Methods employed are robust, graphical, and informal. Applications to social and natural sciences. (Alternate years.).

STAT 541. Applied Multivariate Analysis. 3 Hours.

PR: STAT 511 or equivalent. Introduction to Euclidean geometry and matrix algebra; multiple and multivariate regression including multiple and canonical correlation; the k-sample problem including discriminant and canonical analysis; and structuring data by factor analysis, cluster analysis, and multi-dimensional scaling.

STAT 543. Bioinformatics Data Analysis. 3 Hours.

PR: STAT 512 or equivalent. Statistical analyses of high-throughput experiments using data visualization, clustering, multiple testing, classification and other unsupervised and supervised learning methods. Data processing, including background adjustment and normalization. Case studies.

STAT 545. Applied Regression Analysis. 3 Hours.

PR: STAT 512 or equivalent. Matrix approach to linear and multiple regression, selecting the "best" regression equation, model building, and the linear models approach to analysis of variance and analysis of covariance.

STAT 547. Survival Analysis. 3 Hours.

PR: STAT 512. Survival model methodology, including model selection for incomplete data with censored, truncated, and interval censored observations. Applications to many real life problems using R.

STAT 551. Nonparametric Statistics. 3 Hours.

PR: STAT 511 or equivalent. Distribution-free procedures of statistical inference. Location and scale tests for homogeneity with two or more samples (related or independent); tests against general alternatives.

STAT 555. Categorical Data Analysis. 3 Hours.

PR: STAT 512 or equivalent. Bivariate association for ordinal and nominal variables, models for categorical or continuous responses as a special case of generalized linear models, methods for repeated measurement data, exact small-sample procedures.

STAT 561. Theory of Statistics 1. 3 Hours.

PR: MATH 251. Probability and random variables, univariate and multivariate distributions, expectations, generating functions, marginal and conditional distributions, independence, correlation, functions of random variables, including order statistics, limiting distributions, and stochastic convergence.

STAT 562. Theory of Statistics 2. 3 Hours.

PR: STAT 561. Techniques of point and interval estimation; properties of estimates including bias, consistency, efficiency, and sufficiency; hypothesis testing including likelihood ratio tests and Neyman-Pearson Lemma; Bayesian procedures; analysis of variance and nonparametrics.

STAT 582. Statistical Consulting. 1 Hour.

PR: STAT 513 or Consent. Statistical consulting principles and procedures. The entire consulting experience, including design, models, communication skills, ethics, tracking, and documentation, is presented in a series of case studies, including student presentations and reports on assigned cases.

STAT 590. Teaching Practicum. 1-3 Hours.

PR: Consent. Supervised practice in college teaching of statistics. Note: This courses is intended to insure that graduate assistants are adequately prepared and supervised when they are given college teaching responsibility. It will also present a mechanism for students not on assistantships to gain teaching experience. (Grading may be S/U.).

STAT 591A-Z. Advanced Topics. 1-6 Hours.

PR: Consent. Investigation of advanced topics not covered in regularly scheduled courses.

STAT 593. Special Topics. 1-6 Hours.

A study of contemporary topics selected from recent developments in the field.

STAT 595. Independent Study. 1-6 Hours.

STAT 595. Independent Study. 1-6 HR. Faculty supervised study of topics not available through regular course offerings.

STAT 623. Data Technologies. 3 Hours.

PR: STAT 512 or consent. R data manipulation and processing. Topics include: R operators, functions, data structures, and objects; R data input and output, package development, and text processing; R interfaces to XML and SQL databases.

STAT 624. High Performance Analytics. 3 Hours.

PR: STAT 623. High performance and data-stream computing using R. Topics include: parallel R packages; Hadoop clusters; MapReduce R scripting; shared R network spaces; beyond-memory data analysis; data-stream modeling and visualization.

STAT 641. Multivariate Statistical Theory. 3 Hours.

PR: STAT 541, and STAT 561 or consent. Euclidean vector space theory and matrix algebra, multivariate normal sampling theory, the theory of the multivariate general linear hypothesis including multivariate regression, MANOVA, and MANCOVA, and the theory of factor analysis.

STAT 645. Linear Models. 3 Hours.

PR: STAT 545 and (STAT 462 or STAT 562) or consent. Multivariate normal distribution, distribution of quadratic forms, linear models, general linear hypotheses, experimental design models, components of variance for random effects models.

STAT 682. Statistics Practicum. 1 Hour.

PR: STAT 582. Statistical consulting on university-related research projects under the direction of a statistics faculty member.

STAT 689. Professional Field Experience. 1-6 Hours.

PR: Consent. (May be repeated up to a maximum of 18 hours). Prearranged experiential learning program, to be planned, supervised, and evaluated for credit by faculty and field supervisors. Involves temporary placement with public or private enterprise for professional competence development.

STAT 690. Teaching Practicum. 1-3 Hours.

PR: Consent. Supervised practice in college teaching of statistics. Note: This course is intended to insure that graduate assistants are adequately prepared and supervised when they are given college teaching responsibility. It also provides a mechanism for students not on assistantships to gain teaching experience. (Grading may be S/U.).

STAT 691A-Z. Advanced Topics. 1-6 Hours.

PR: Consent. Investigation of advanced topics not covered in regularly scheduled courses.

STAT 692. Directed Study. 1-6 Hours.

Directed study, reading, and/or research.

STAT 693. Special Topics. 1-6 Hours.

A study of contemporary topics selected from recent developments in the field.

STAT 694. Seminar. 1-6 Hours.

Special seminars arranged for advanced graduate students.

STAT 695. Independent Study. 1-6 Hours.

Faculty supervised study of topics not available through regular course offerings.

STAT 696. Graduate Seminar. 1 Hour.

PR: Consent. Each graduate student will present at least one seminar to the assembled faculty and graduate student body of his or her program.

STAT 697. Research. 1-15 Hours.

PR: Consent. Research activities leading to thesis, problem report, research paper or equivalent scholarly project, or a dissertation. (Grading may be S/U.).

STAT 698. Thesis or Dissertation. 1-6 Hours.

PR: Consent. This is an optional course for programs that wish to provide formal supervision during the writing of student reports (698), or dissertations (798). Grading is normal.

STAT 699. Graduate Colloquium. 1-6 Hours.

PR: Consent. For graduate students not seeking coursework credit but who wish to meet residency requirements, use of the University's facilities, and participate in its academic and cultural programs. Note: Graduate students who are not actively involved in coursework or research are entitled, through enrollment in their department's 699/799 Graduate Colloquium to consult with graduate faculty, participate in both formal and informal academic activities sponsored by their program, and retain all of the rights and privileges of duly enrolled students. Grading is P/F; colloquium credit may not be counted against credit requirements for masters programs. Registration for one credit of 699/799 graduate colloquium satisfies the University requirement in the semester in which graduation occurs.

STAT 745. Data Mining. 3 Hours.

PR: STAT 545 or equivalent. Development of predictive models for large datasets, including logistic and linear models, regression and classification trees, and neural networks. Data preparation, including imputation and filtering.

STAT 761. Theoretical Statistics 1. 3 Hours.

PR: STAT 562 or consent. Advanced statistical theory including: consistent estimators; limiting distributions; asymptotic properties; goodness-of-fit tests; maximum likelihood estimation, moment generating functions; properties of statistical tests and procedures for finite-dimensional and infinite-dimensional spaces.

STAT 762. Theoretical Statistics 2. 3 Hours.

PR: STAT 761. Continuation of STAT 761 including: asymptotic optimality, contiguity of probability measures, local asymptotic normality of likelihood ratio test, Bayesian estimation, general linear models estimation and testing, and kernel smoothing methods in density and regression estimation.

STAT 763. Stochastic Processes. 3 Hours.

PR: STAT 561. Modeling of random phenomenon occurring over time, space, or time and space simultaneously. Modern techniques, such as the martingale decomposition, are applied to different statistical models.

STAT 765. Statistical Methods-Bioinformatics. 3 Hours.

PR: STAT 561. Constructions of probabilistic models describing biological DNA and protein sequence data. Investigation of asymptotic properties of various test statistics.

STAT 791A-Z. Advanced Topics. 1-6 Hours.

PR: Consent. Investigation of advanced topics not covered in regularly scheduled courses.

STAT 797. Research. 1-15 Hours.

PR: Consent. Research activities leading to thesis, problem report, research paper or equivalent scholarly project, or a dissertation. (Grading will be S/U).


Faculty

Chair

  • Michael Mays - Ph.D. (Penn State University)

Professors

  • E. James Harner - Ph.D. (Cornell U.)
    Dynamic graphics, Statistical computing and modeling, Statistical education.
  • Erdogan Gunel - Ph.D. (State University of New York, Buffalo)
    Bayesian Inference, Biostatistics, Categorical Data Analysis
  • Robert Mnatsakanov - Ph.D. (Moscow State Institute of Electronics and Mathematics)
    Nonparametric statistics, Statistical Inverse Problems, Mixture Models, Change-set Problems

Associate Professors

  • Mark V. Culp - Ph.D. (University of Michigan)
    Statistical Machine Learning, Computational Statistics, Semi-supervised and Multi-view Learning, Biometrics
  • Gerald R. Hobbs Jr. - Ph.D. (Kansas State University)
    Biostatistics, Nonparametric Statistics, Regression Analysis
  • Kenneth J. Ryan - Ph.D. (Iowa State University)
    Experimental Design, Statistical Machine Learning, Biometrics

Assistant Professors

  • Stacey Culp - Ph.D. (University of Michigan)
    Statistics Education and Statistical Consulting
  • Casey Jelsema - Ph.D. (Western Michigan University)
    Spatial statistics, mixed effects models, Bayesian hierarchical modeling, constrained inference, bootstrap methods, environmental statistics, microbiome, statistical computation.
  • Erin R. Leatherman - Ph.D. (Ohio State)
    Prediction and Design for Computer and Physical Experiments.

Teaching Associate Professor

  • Huey Miin Lee - Ph.D. (Johns Hopkins University)
    Bioinformatics, Statistical Education

Teaching Instructor

  • Anthony Billings - M.S. (West Virginia University, A.B.D. (Carnegie Mellon University)
    Statistical Computing, Statistical Modeling, Robust Estimation, Nonlinear Dynamic Systems, Statistical Education

Professor Emeritus

  • William V. Thayne - Ph.D. (University of Illinois)
    Experimental Design, Statistical Genetics, Regression Analysis
  • Edwin C. Townsend - Ph.D. (Cornell University)
    Experimental Design, Regression Analysis

Associate Professor Emeritus

  • Daniel M. Chilko - M.S. (Rutgers University)
    Statistical Computing, Computer Graphics