**Course title: ****Machine Learning**

**Course number: **COMP.5450 (graduate), COMP.4220 (undergraduate)

**Semester: **Spring 2018

**Location: **Olsen Hall 503

**Meeting Times: **Tue., 5:30PM-8:15PM

**Instructor: ****Dr. Jerome J. Braun**, jbraun@cs.uml.edu, jerome_braun@uml.edu (Note: **students** who need to email me should email only to my **jerome_braun@uml.edu** address )

**TA: **Xinzi Sun, Xinzi_Sun@student.uml.edu

**Course o****verview**

This course provides fundamentals of Machine Learning and its algorithmic techniques for pattern recognition, Artificial Intelligence (AI), data science, and other applications. Topics include foundations of machine learning comprising statistical and probabilistic methods, generative and discriminative models, linear regression, Bayesian methods, parametric and non-parametric classification, supervised and unsupervised learning, clustering and dimensionality reduction, and applications to large datasets.

**Prerequisites**

Pre-Reqs: COMP 1020 Computing II, MATH 3220 Discrete Structures II and MATH 3860 Probability & Statistics I.

**Please note:**

This is a graduate/upper-level undergraduate course. All students should be familiar with fundamentals of probability, linear algebra, and calculus. All must also be familiar with algorithmic and programming techniques needed for the implementation of machine-learning algorithms that will be taught in this course.

The above proficiencies are assumed typical for students who have successfully completed the following courses (or their equivalents): Discrete Structures II (or Linear Algebra), Probability & Statistics I, Computing II.

Students enrolled in the graduate section are expected to have proficiency in the areas of probability and linear algebra commensurate with their graduate status, and are assumed typical for students who have completed Linear Algebra I and II and Calculus III.

**”Quiz 0” on prerequisite math knowledge **(including Probability, Calculus, basic Linear Algebra /Matrix Algebra/ ) may, and most likely will, be administered early during the course. This quiz will test mathematical skills necessary for the course. Moreover, if administered, this quiz is intended to help the students self-evaluate their prior knowledge of mathematical fundamentals expected of students taking this course. In all cases, the responsibility to have or to acquire mathematical skills needed for this course shall rest with the students. If administered, “Quiz 0” will be required to be taken by all students in this course at the date and time of its administration.

**Textbooks**

Required textbook:

**Bishop, C. M., ‘***Pattern Recognition and Machine Learning*,’ Springer, 2006.

Other recommended books:

- Koller, D. and Friedman, N. ‘
*Probabilistic Graphical Models*,’ MIT Press. 2009. - Hastie, T., Tibshirani, R. and Friedman, J., ‘
*The Elements of Statistical Learning*,’ Springer. 2001. - Theodoridis, S. and Koutroumbas K., ‘
*Pattern Recognition*,’ Academic Press. - Duda, R.O., Hart, P.E., and Stork, D.G., ‘
*Pattern Classification*,’ Wiley-Interscience, 2001. - Russell, S. and Norvig, N., ‘
*Artificial Intelligence: A Modern Approach*,’ Prentice Hall Series in Artificial Intelligence. 2003. - Scholkopf B. and Smola A., ‘
*Learning with Kernels*,’ MIT Press, 2002. - Vapnik, V., ‘
*The Nature of Statistical Learning Theory*,’ Springer. - Goodfellow, I., Bengio, Y., Courville, A., ‘
*Deep Learning*,’ MIT Press, 2016.

**Topics Overview / Schedule (tentative)**

- Introduction to Machine Learning. Review of mathematical concepts.
- Probabilistic Approaches. Maximum Likelihood principle. Linear Regression, Bayesian

regression. - Classification task, Generative vs. Discriminative approaches, Logistic Regression, Linear Discriminant Analysis (LDA)
- Regularization, Bias-variance trade-off, Bayesian Models.
- Artificial Neural Networks.
- Practical advice for applying learning algorithms: debugging, feature/model design, performance evaluation methods.
- Statistical Learning Theory (VC theory).
- Support Vector Machines (SVMs).
- Probabilistic Graphical Models and Hidden Markov Models (HMM).
- Combining Models: Bagging and Boosting.
- Unsupervised learning: clustering.
- Introduction to deep learning and large-scale machine learning.

**Course Outcomes**

After successfully completing this course, the students should be able to understand:

- Design, construction and evaluation of a machine-learning (pattern-recognition) system.
- Mathematical foundations of several mainstream machine-learning algorithms.
- Major classes of approaches in machine-learning (pattern-recognition) literature.
- Selected theoretical issues involved in machine-learning algorithm design.
- Implementation of machine-learning techniques in scientific-computing environment.

**Grading**

Students will be evaluated based on the following:

- Midterm quiz/exam (“Quiz 1”), administered in-class: 16% of total course-grade.
- Final quiz/exam (“Quiz 2”), administered in-class: 27% of total course-grade.
- Should a final exam (43%) or a midterm exam (43%) be announced
*in lieu*(*instead*) of Quizzes/exams 1 and 2 (an**unlikely**situation), such an announcement would be made in advance. - “Quiz 0”, administered in-class: 4% of total course-grade.
- In the unlikely event of “Quiz 0” not being administered, the above 4% would be shifted to the weight of “Quiz 1” or “Quiz 2” or distributed between them. However, if administered, this shifting of weight will
**not**be an option (i.e., if administered, “Quiz 0” will be**non**-optional and will be required to be taken by all students). - Homework: homework problem-sets,
*up to*4 assignments: 8% total. - Self-graded; must hand in completed homework and grade; self-assigned grades will be verified by us at random.
- Late-homework Policy: 50% off any assignment handed in up to one week after its due date (i.e., by the beginning of the following week’s class); 100% off (no credit) afterwards.
- Final Project: 40%.
- To be performed as teamwork by self-organized teams of, generally, up to three students enrolled in this course.
- Final-project deliverables required from each project team include (but are not limited to) final project-paper/report and final oral presentation.
- Class attendance and participation: 5%.

Participation in all administered exams/quizzes is required. In particular, non-participation or receiving a zero-score in the midterm exam/quiz or in the final exam/quiz will normally result (regardless of any other grade-components, e.g., scores on other quizzes/exams, final project, etc.) in receiving the course-grade F, i.e., in failing the course.

**Grading of COMP.5450 students**: Graduate course COMP.5450 is co-listed with the undergraduate course COMP.4220. Students enrolled in COMP.5450 (graduate course) will be expected to achieve higher scores (and/or submit additional work) to receive the same grade as students enrolled in COMP.4220 (undergraduate course).

**Policies**

**Students are expected to adhere to all policies of UMass Lowell and of the CS department. Please refer to descriptions of policies available on UMass Lowell website.**

**Academic Honesty Policy**: Students are expected to honor and follow all CS department and UMass Lowell policies related to academic honesty and integrity. Violators risk failing the course in addition to any actions taken by the university administration. Cheating will not be tolerated, and students who cheat risk failing the course and possible university administrative actions.

All work on quizzes/exams must be the student's own work.

The work on homework assignments must also be the student's own work, with the following exceptions: 1) hints provided by the TA or the instructor may be used, provided that after obtaining such hints the students perform the assignment on their own, and that having obtained hints is acknowledged in writing in the student's work; 2) forming study-groups is allowed (and encouraged) and students may engage in discussions related to homework assignments, provided that following such discussions students complete the homework assignment separately on their own (without referring/copying detailed notes from those discussions) and that occurrence of such discussions is acknowledged in writing on the homework assignment (however, doing a homework assignment together by more than one person is not permitted). Using homework solutions from any source, such as websites or past-year’s solutions obtained from any source, is not permitted.

**Project work: **The mandatory required final-project (see Grading section above) is expected to be done as teamwork performed by self-organized teams of, generally, up to three students enrolled in this course. Accordingly, a collaborative work within these project teams on matters specific to their respective final project is permitted and required (all members of the project team must contribute). Please note, however, that this permission applies to the above final-project work only; it does not apply to any other gradable items (such as homework assignments, exams/quizzes).

**Important notice regarding email:**

- Any email you send to me should to be sent
**only**from your UMass Lowell student email account (i.e., “at student uml edu”). I will reply**only**to email messages you send from your UML student email address. **No email attachments:**If you email me, please note that I will NOT accept emails that contain any attachments. In general, there should be no need to include any attachments in your email messages to me. Unlikely exceptions to this rule would be on a one-time, case-by-case basis**AND**would require you to request (with an appropriate justification)**and obtain from me****ahead of time**an explicit one-time permission to send a specific attachment. Any and all email messages you send me, or cc/copy me on, that contain any unexpected attachment will not be considered as having been received, will not be replied to, and**may be deleted and/or unopened.**