Doctor of Philosophy with a major in Machine Learning

The Doctor of Philosophy with a major in Machine Learning program has the following principal objectives, each of which supports an aspect of the Institute’s mission:

  1. Create students that are able to advance the state of knowledge and practice in machine learning through innovative research contributions.
  2. Create students who are able to integrate and apply principles from computing, statistics, optimization, engineering, mathematics and science to innovate, and create machine learning models and apply them to solve important real-world data intensive problems.
  3. Create students who are able to participate in multidisciplinary teams that include individuals whose primary background is in statistics, optimization, engineering, mathematics and science.
  4. Provide a high quality education that prepares individuals for careers in industry, government (e.g., national laboratories), and academia, both in terms of knowledge, computational (e.g., software development) skills, and mathematical modeling skills.
  5. Foster multidisciplinary collaboration among researchers and educators in areas such as computer science, statistics, optimization, engineering, social science, and computational biology.
  6. Foster economic development in the state of Georgia.
  7. Advance Georgia Tech’s position of academic leadership by attracting high quality students who would not otherwise apply to Tech for graduate study.

All PhD programs must incorporate a standard set of Requirements for the Doctoral Degree.

The central goal of the PhD program is to train students to perform original, independent research.  The most important part of the curriculum is the successful defense of a PhD Dissertation, which demonstrates this research ability.  The academic requirements are designed in service of this goal.

The curriculum for the PhD in Machine Learning is truly multidisciplinary, containing courses taught in nine schools across three colleges at Georgia Tech: the Schools of Computational Science and Engineering, Computer Science, and Interactive Computing in the College of Computing; the Schools of Aerospace Engineering, Chemical and Biomolecular Engineering, Industrial and Systems Engineering, Electrical and Computer Engineering, and Biomedical Engineering in the College of Engineering; and the School of Mathematics in the College of Science.

Summary of General Requirements for a PhD in Machine Learning

  • Core curriculum (4 courses, 12 hours). Machine Learning PhD students will be required to complete courses in four different areas: Mathematical Foundations, Probabilistic and Statistical Methods in Machine Learning, ML Theory and Methods, and Optimization.   
  • Area electives (5 courses, 15 hours).
  • Responsible Conduct of Research (RCR) (1 course, 1 hour, pass/fail).  Georgia Tech requires that all PhD students complete an RCR requirement that consists of an online component and in-person training. The online component is completed during the student’s first semester enrolled at Georgia Tech.  The in-person training is satisfied by taking PHIL 6000 or their associated academic program’s in-house RCR course.
  • Qualifying examination (1 course, 3 hours). This consists of a one-semester independent literature review followed by an oral examination.
  • Doctoral minor (2 courses, 6 hours).
  • Research Proposal.  The purpose of the proposal is to give the faculty an opportunity to give feedback on the student’s research direction, and to make sure they are developing into able communicators.
  • PhD Dissertation.

Almost all of the courses in both the core and elective categories are already taught regularly at Georgia Tech.  However, two core courses (designated in the next section) are being developed specifically for this program.  The proposed outlines for these courses can be found in the Appendix. Students who complete these required courses as part of a master’s program will not need to repeat the courses if they are admitted to the ML PhD program.

Core Courses

Machine Learning PhD students will be required to complete courses in four different areas. With the exception of the Foundations course, each of these area requirements can be satisfied using existing courses from the College of Computing or Schools of ECE, ISyE, and Mathematics.

Machine Learning core:

Mathematical Foundations of Machine Learning. This required course is the gateway into the program, and covers the key subjects from applied mathematics needed for a rigorous graduate program in ML. Particular emphasis will be put on advanced concepts in linear algebra and probabilistic modeling. This course is cross-listed between CS, CSE, ECE, and ISyE.

ECE 7750/ISYE 7750/CS 7750/CSE 7750 Mathematical Foundations of Machine Learning

Probabilistic and Statistical Methods in Machine Learning

Machine Learning: Theory and Methods.  This course serves as an introduction to the foundational problems, algorithms, and modeling techniques in machine learning.  Each of the courses listed below treats roughly the same material using a mix of applied mathematics and computer science, and each has a different balance between the two. 

Optimization.  Optimization plays a crucial role in both developing new machine learning algorithms and analyzing their performance.  The three courses below all provide a rigorous introduction to this topic; each emphasizes different material and provides a unique balance of mathematics and algorithms.

Electives

After core requirements are satisfied, all courses listed in the core not already taken can be used as (appropriately classified) electives.

In addition to meeting the core area requirements, each student is required to complete five elective courses. These courses are required for getting a complete breadth in ML. These courses must be chosen from at least two of the five subject areas listed below. In addition, students can use up to six special problems research hours to satisfy this requirement. 

i. Statistics and Applied Probability: To build breadth and depth in the areas of statistics and probability as applied to ML.

ii. Advanced Theory: To build a deeper understanding of foundations of ML.

iii. Applications: To develop a breadth and depth in variety of applications domains impacted by/with ML.

iv. Computing and Optimization: To provide more breadth and foundation in areas of math, optimization and computation for ML.

v. Platforms: To provide breadth and depth in computing platforms that support ML and Computation.

  • CS 6421, Temporal, Spatial, and Active Databases
  • CS 6430, Parallel and Distributed Databases
  • CS 6290, High-Performance Computer Architecture
  • CSE 6220, High Performance Computing
  • CSE 6230, High Performance Parallel Computing

Qualifying Examination

The purpose of the Qualifying Examination is to judge the candidate’s potential as an independent researcher.

The Ph.D. qualifying exam consists of a focused literature review that will take place over the course of one semester.  At the beginning of the second semester of their second year, a qualifying committee consisting of three members of the ML faculty will assign, in consultation with the student and the student’s advisor, a course of study consisting of influential papers, books, or other intellectual artifacts relevant to the student’s research interests.  The student’s focus area and current research efforts (and related portfolio) will be considered in defining the course of study.

At the end of the semester, the student will submit a written summary of each artifact which highlights their understanding of the importance (and weaknesses) of the work in question and the relationship of this work to their current research.  Subsequently, the student will have a closed oral exam with the three members of the committee.  The exam will be interactive, with the student and the committee discussing and criticizing each work and posing questions related the students current research to determine the breadth of student’s knowledge in that specific area.  

The success of the examination will be determined by the committee’s qualitative assessment of the student’s understanding of the theory, methods, and ultimate impact of the assigned syllabus.

The student will be given a passing grade for meeting the requirements of the committee in both the written and the oral part. Unsatisfactory performance on either part will require the student to redo the entire qualifying exam in the following semester year. Each student will be allowed only two attempts at the exam.

Students are expected to perform the review by the end of their second year in the program.

Doctoral Dissertation

The primary requirement of the PhD student is to do original and substantial research.  This research is reported for review in the PhD dissertation, and presented at the final defense. 
As the first step towards completing a dissertation, the student must prepare and defend a Research Proposal.  The proposal is a document of no more than 20 pages in length that carefully describes the topic of the dissertation, including references to prior work, and any preliminary results to date.  The written proposal is submitted to a committee of three faculty members from the ML PhD program, and is presented in a public seminar shortly thereafter.  The committee members provide feedback on the proposed research directions, comments on the strength of writing and oral presentation skills, and might suggest further courses to solidify the student’s background.  Approval of the Research Proposal by the committee is required at least six months prior to the scheduling of the PhD defense. It is expected that the student complete this proposal requirement no later than their fourth year in the program.
The PhD thesis committee consists of five faculty members: the student’s advisor, three additional members from the ML PhD program, and one faculty member external to the ML program.  The committee is charged with approving the written dissertation and administering the final defense.  The defense consists of a public seminar followed by oral examination from the thesis committee.

Doctoral minor (2 courses, 6 hours): 

The minor follows the standard Georgia Tech requirement: 6 hours, preferably outside the student’s home unit, with a GPA in those graduate-level courses of at least 3.0.  The courses for the minor should form a cohesive program of study outside the area of Machine Learning; no ML core or elective courses may be used to fulfill this requirement and must be approved by your thesis advisor and ML Academic Advisor.  Typical programs will consist of three courses two courses from the same school (any school at the Institute) or two courses from the same area of study.