CAP 5636 - Advanced Artificial Intelligence

Fall 2025

Course description:	Principles of artificial intelligence. Uninformed and informed search. Constraint satisfaction. AI for game playing. Probabilistic reasoning, Markov decision processes, hidden Markov models, Bayes nets. Neural networks and deep learning.
Student learning outcomes:	By the end of the semester the students will be able to: understand the search and decision making techniques used in modern artificial intelligence apply artificial intelligence techniques in their own code understand the societal and ethical implications of artificial intelligence
Instructor:	Dr. Lotzi Bölöni
Office Location:	HEC - 319
E-mail:	Ladislau.Boloni@ucf.edu (preferred means of communication)
Team:	TA: Anthony Bilic anthony.bilic@ucf.edu Grader: Rafael Montalvo rafael.montalvo2@ucf.edu
Web Site:	http://www.cs.ucf.edu/~lboloni/Teaching/CAP5636_Fall2025/index.html The assignments and the other announcements will be posted on the course web site
Classroom:	CB-0307
Class hours:	Tue, Th 12:00pm - 1:15pm
Office hours:	Tue, Th 1:30pm - 3:00pm (in HEC 319)
Enrollment requirements:	CAP 4630, or consent of instructor.
Required texts:	There is no required textbook.
Recommended readings:	Stuart Russel and Peter Norvig, Artificial Intelligence - A Modern Approach, 4th edition
Verification of engagement:	As of Fall 2014, all faculty members are required to document students' academic activity at the beginning of each course. In order to document that you began this course, please complete the following academic activity by the end of the first week of classes, or as soon as possible after adding the course. Failure to do so will result in a delay in the disbursement of your financial aid. To satisfy this requirement, you must finish the first quiz posted online. Log in to Webcourses, choose CAP 5636, and submit your answers online.

Syllabus

Date	Topic	Lecture Notes, Readings, Homeworks
Tue, Aug. 19	Introduction and content of the class The agent view of AI. Topics covered by the AI and ML classes. History and positioning of AI Motivating AI. Dangers of AI and AGI. Early history Expert systems	[slides] Content of the class [slides] History and positioning of AI [homework] HW1: AI History and Future - Due Aug. 24, 2025 [reading] Perceptrons (New York Times, July 13, 1958) [reading] Nick Bostrom - How long before superintelligence? (1998)
Thu, Aug. 21	History and positioning of AI Neural networks The two intellectual traditions: logic vs neural networks A melting pot of other ideas The agent view of AI	[homework] HW2: Intro to deep learning - Due Sept. 4, 2025
Tue, Aug. 26	Probability Random variables Joint and marginal distributions, conditional distribution	[slides] Probabilities - Introduction
Thu, Aug. 28	Independent variables and Bayes' nets Product rule, chain rule, Bayes' rule Inference Independence Independent random variables Conditional independence Bayes' nets	[slides] Independence
Tue, Sept. 2	Planning with uninformed search Reflex agents Search problems	[slides] Uninformed search
Thu, Sep. 4	Planning with uninformed search (cont'd) Depth first and breadth first search Uniform cost search
Tue, Sep. 9	*Planning with informed search: A search and heuristics** Informed search methods Heuristics Greedy search A* search Graph search	[slides] Informed search
Thu, Sep. 11	Game playing and adversarial search Types of games Adversarial search, minimax Alpha Beta pruning	[homework] HW3 - Planning with a goat - Due September 25th, 2025 [slides] Adversarial search
Tue, Sep. 16	*Midterm 1 - Introduction to A (not including game play)**
Thu, Sept. 18	Game playing and adversarial search (cont'd) The problem of depth Evaluation functions
Tue, Sep. 23	Expectimax search Expectimax search	[slides] Expectimax search
Thu, Sept. 25	State of the art in game play Monte Carlo tree search AlphaGo AlphaGo Zero Real life games	[slides] Game play state of the art
Tue, Sept 30	Utilities and rationality Utilities Axioms of rationality Rewards	[slides] Utilities and rationality
Thu, Oct. 2	Markov decision processes Defining MDPs: policies and utilities Optimal policy, value of state, value of Q-state	[slides] Markov Decision Processes
Tue, Oct. 7	Markov decision processes 2 Value iteration	[homework] HW4 - MDP and Q-Learning - Due October 24
Thu, Oct. 9	Markov decision processes 3 Policy extraction Policy iteration
Tue, Oct. 14	Reinforcement learning Reinforcement learning as a twist on MDPs Model-based and model-free learning Temporal difference learning	[slides] Reinforcement learning
Thu, Oct 16	Reinforcement learning (cont'd) Exploration vs. exploitation, regret Generalization across states	[homework] HW5 - DQN - Due November 6th
Tue, Oct. 21	Midterm 2: from Game Play to MDPs (not including Reinforcement Learning)
Thu, Oct. 23	Deep reinforcement learning Challenges with Q estimation The "deadly triad" DQN Double Q-learning
Tue, Oct. 28	Policy gradient RL Policy gradient theorem Using baselines	[slides] Policy gradient RL
Thu, Oct. 30	Reinforcement learning state of the art Soft actor-critic (SAC) Proximal policy optimization (PPO) Application: reinforcement learning from human feedback (RLHF)	[slides] RL State of the Art
Tue, Nov. 4	Imitation learning Behavior cloning Inverse reinforcement learning	[slides] Imitation learning
Thu, Nov. 6	Imitation learning (cont'd)
Tue, Nov. 11	Veteran's day, no class
Thu, Nov. 13	Hidden Markov models Hidden Markov models Example: robot localization	[slides] Hidden Markov Models
Tue, Nov. 18	Hidden Markov models (cont'd)	[homework] HW6 - Societal implications - Due November 28
Thu, Nov. 20	Artificial General Intelligence - Definitions and Tests Intelligence tests (Turing, Loebner prize, Winograd schema) Artificial Superintelligence Alignment problems LLMs	[slides] AGI definitions and tests
Tue, Nov. 25	Societal implications of AI Economic impact Societal changes Existential threat Ethical issues	[slides] Societal implications of AI
Thu, Dec. 4	Final exam: Thursday December 4, 2024, 10:00am - 12:50pm