**Professor Ke-Sheng Cheng (Email: rslab@ntu.edu.tw)**

RSLAB_BSE_NTU

No. 1, Section 4, Roosevelt Road

Bioenvironmental Syst. Eng., National Taiwan University

This course is designed to facilitate students with skills of R coding for data analysis and graphic presentation, while learning fundamental concepts of statistical methods. Students who want to register for this course should have taken an entry level statistics. Familiarity with R language is not required, although it will be helpful.

My philosophy of teaching this course is to design a list of problems, each with specific learning objectives (important concepts and theories), and to guide students to solve these problems through computer coding (using R) and in-class discussions. Such an approach will enable students to learn R quickly and gain insights into more complicated or abstract concepts of many statistical methods. Thus, this class will be conducted in an **interactive format** through the following arrangements:

- Subjects and statistical methods (
**SSMs**) to be covered will be fully explained in the beginning of the semester (the first three or four weeks). For each SSM, problems to be solved or tasks to be conducted will be given. - Students are grouped (based on their backgrounds or interests) into several groups. Each group will be assigned certain SSMs to study during the semester.
- Every week, two or more groups (depending on number of groups in the class) will present their progress and results in class, followed by discussions.
- Each group is expected to make several presentations during the semester. Hopefully, once in every 3 to 4 weeks.
- A final presentation is required for all groups in the final week.

Students will be evaluated based on their performance in the progress report and presentations, as well as their participations in class discussions.

Note: **Stochastic simulation** is an essential element of this class. Through stochastic simulation, students will observe realizations and have a better understanding of statistical theories.

**Weekly schedule of individual groups **

### SSM0 Water consumption prediction

*09-11-2018*

The monthly water consumption and payment by an institute over the last 10 years (Jan 2009 - Sep 2018) is listed in the EXCEL file HRI_Water.csv. You are asked to- Conduct statistical analyses on the data
- Predict the water consumption of Oct, Nov, and Dec 2018.

### SSM1 Drought index (SPI) calculation and spatiotemporal visualization

*09-11-2018*

Standardized Precipitation Index (SPI) is a measure of drought. You will learn how to calculate SPI using daily rainfalls of different rainfall stations and use the results to evaluate the spatiotemporal variation of drought occurrences.**Data to be used:****Daily rainfall data (04/01/1995 - 03/31/2007) at 50 rainfall stations****Location (latitude, longitude) and station ID of 50 rainfall stations**

**Expected results****For SPI calculation, use 1 ten-day-period (TDP) as the operation scale and the SPI should has a time resolution of 3 TDPs.****Show the spatiotemporal variation of SPI values**

### SSM2 Supervised classification - the multivariate Gaussian maximum likelihood classifier

*09-19-2018*- Simulation of 2-class, 2-feature Gaussian maximum likelihood classification.
- Confusion matrix
- Uncertainty assessment of classification accuracy

### SSM3 Stochastic simulation of bivariate gamma distribution

*11-14-2018***SERRA article**### SSM4 Image mosaic for multispectral remote sensing images

### SSM5 Quantile mapping for bias correction

### SSM6 Asymptotic distribution of the test statistic of the Kolmogorov-Smirnov test

### SSM7 Change detection using the Mann-Whitney-Pettitt (MWP) test

*01-09-2019***A Non-parametric Approach to the Change-point Problem (A.N. Pettitt, Journal of the Royal Statistical Society. Series C, 1979)**### SSM8 L-moment-ratio diagram (LMRD) for GOF test

*09-19-2018*

Establishing acceptance regions for L-moments basedgoodness-of-fit tests by stochastic simulation.*Journal of Hydrology*, Vol. 355, No.1-4, 49-62. (doi:10.1016/j.jhydrol.2008.02.023).**SSM9 Rejection method for random number generation**### SSM10 Rainfall-Runoff Modeling

**SSM11 Rainfall frequency analysis using annual maximum series (AMS) and event maximum series (EMS)**### SSM12 IDF Uncertainty - Bootstrap sampling

*10-31-2018***Hourly rainfall data of two rainfall stations in northern Taiwan**. (**Hourly_Rainfall_Data.zip**)- Extract annual maximum rainfalls of various durations (1, 2, 3, 6, 12, 24, 48 hours). These are known as the annual maximum series (AMS).
- Conduct goodness-of-fit test to choose the best probability distribution for rainfall frequency analysis.
- Determine distribution parameters by using the method of moments and method of L-moments.
- For a specific duration, calculate the design rainfall depths of 5, 10, 25, 50, 100 and 200-year return periods.
- Plot the Duration-Depth-Frequency (return period) curve and Intensity-Depth-Frequency (IDF) curve.
- Evaluate the results
- Investigate uncertainty of the IDF curve by bootstrapping from the annual maximum series.

**RSLAB - NTU**

**Prof. Ke-Sheng Cheng **

RSLAB_BSE_NTU

No. 1, Section 4, Roosevelt Road

Bioenvironmental Syst. Eng., National Taiwan University