(original) (raw)
��
<![if !vml]>
<![endif]><o:p>
Fall 2024: Machine Learning with Graphs CS 7332 (CRN 20227) & NETS 7332 (CRN 17372)
<o:p>
General Information <o:p>
Time: Tuesdays & Fridays 1:35 3:15 PM Eastern<o:p> | Place: 177 Huntington Ave, 2nd Floor Conference Room (#207)<o:p> |
Instructor: Tina Eliassi-Rad<o:p>Course website on Canvas: https://northeastern.instructure.com/courses/194307� <o:p> | Office hours: Available by appointment. Email eliassi [at] ccs [dot] neu [dot] edu |
<o:p>
Overview <o:p>
This 4-credit PhD-level course covers state-of-the-art research on mining and learning with graphs. Topics include, but are not limited to, vertex classification, graph clustering, link prediction and analysis, graph distances, graph embedding and network representation learning, deep learning on graphs, anomaly detection on graphs, graph summarization, network inference, adversarial learning on networks, and notions of fairness in social networks.
<o:p>
Prerequisites<o:p>
Students are expected to have taken courses on or have knowledge of the following:
<![if !supportLists]>o <![endif]>Calculus and linear algebra<o:p>
<![if !supportLists]>o <![endif]>Basic statistics, probability, machine learning, or data mining<o:p>
<![if !supportLists]>o <![endif]>Algorithms and programming skills (e.g., Python, Julia, C, C++, Java, Ruby, Matlab, or any programming language of their preference)<o:p>
<o:p>
Textbooks <o:p>
This course does not have a designated textbook. The readings are assigned in the syllabus (see below).
<o:p>
Here are some textbooks (all optional) on machine learning and data mining:
- Deep Learning and Graph Representation Learning
- Charu C. Aggarwal. Neural Networks and Deep Learning: A Textbook. Springer, 2018.
- Michael Bronstein, Joan Bruna, Taco Cohen and Petar Veli kovi. Geometric Deep Learning: Grids, Graphs, Groups, Geodesics, and Gauges. arXiv:2104.13478, April 2021.
- Ian Goodfellow, Yoshua Bengio, Aaron Courville. Deep Learning. MIT Press, 2016.
- William L. Hamilton. Graph Representation Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, Vol. 14, No. 3, Pages 1-159, 2020.
- Jure Leskovec. Machine Learning with Graphs Video Lectures, Stanford Online, 2021.
- Michael M. Bronstein, Joan Bruna, Taco Cohen, Petar Veli kovi. Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges, 2021.
- Xavier Bresson. Graph Machine Learning, 2022-23.
- Data Mining and Graph Mining
- Charu C. Aggarwal. Data Mining, The Textbook. Springer 2015.
- Christos Faloutsos, Deepayan Chakrabarti. Graph Mining: Laws, Tools, and Case Studies. Morgan & Claypool Publishers, 2012.
- Jiawei Han, Micheline Kamber, Jian Pei. Data Mining: Concepts and Techniques. Morgan Kaufmann, 3rd edition, 2011.
- David J. Hand, Heikki Mannila, Padhraic Smyth. Principles of Data Mining. A Bradford Book, 2001
- Anand Rajaraman, Jurij Leskovec, and Jeffrey Ullman. Mining of Massive Datasets. Cambridge University Press, v2.1, 2014. (free online) (Errata)
- Pang-Ning Tan, Michael Steinbach, Vipin Kumar. Introduction to Data Mining. Pearson, 2nd edition, 2018.
- Machine Learning
- Charu C. Aggarwal. Linear Algebra and Optimization for Machine Learning: A Textbook. Springer, 2020.
- Christopher Bishop. Pattern Recognition and Machine Learning. Springer, 2006.
- Peter Flach. Machine Learning: The Art and Science of Algorithms that Make Sense of Data. Cambridge University Press, 2012.
- Tom Mitchell. Machine Learning. McGraw Hill, 1997.
- Kevin P. Murphy. Machine Learning: A Probabilistic Perspective. MIT Press, 2012.
- Statistics
- Trevor Hastie, Robert Tibshirani, Jerome Friedman. Elements of Statistical Learning. Springer, 2nd edition, 2009. (free online)
<o:p>
Resources <o:p>
- Probability Review (David Blei, Princeton)
- Probability Theory Review (Arian Maleki and Tom Do, Stanford)
- Linear Algebra Review and Reference (Zico Kolter and Chuong Do, Stanford)
- Theoretical CS Cheat Sheet (Princeton)
- Encyclopedia of Distances (Michel Marie Deza and Elena Deza)
<o:p>
Grading<o:p>
<![if !supportLists]>o <![endif]>Class presentations (40%)<o:p>
<![if !supportLists]>o <![endif]>I will team up students into groups.<o:p>
<![if !supportLists]>o <![endif]>Each week will have an assigned team. That team is responsible for presenting the readings for that week. <o:p>
<![if !supportLists]>o <![endif]>Besides the readings, each paper is likely to have additional materials on the Web. Examples include supplemental materials, video, code, data, etc. These are helpful for presentations and class projects.<o:p>
<![if !supportLists]>o <![endif]>Your slides for the lecture presentations are due at 12:00 PM Eastern on the day assigned to your team.<o:p>
<![if !supportLists]>o <![endif]>Class project (50%)<o:p>
<![if !supportLists]>o <![endif]>Each team will choose (by Tuesday October 8, 2024 at 11:59 PM Eastern) one of the papers in the syllabus to replicate.
<![if !supportLists]>o <![endif]>In addition to replication, each team will propose extension(s) to the chosen paper and implement those extension(s).
<![if !supportLists]>o <![endif]>Each team will write a report (maximum 6 pages) detailing what was learned. Use the style files at https://paperswithcode.com/static/rc2020/ML-Reproducibility-Challenge-2020-Template.zip. <o:p>
<![if !supportLists]>o <![endif]>Reports are due on Tuesday December 10, 2024 at 11:59 PM Eastern.<o:p>
<![if !supportLists]>o <![endif]>Project Presentations (10%)<o:p>
<![if !supportLists]>o <![endif]>Each team will have ~15 minutes to present what they learned when they tried to reproduce their chosen paper, followed by 5 minutes of Q&A.<o:p>
<![if !supportLists]>o <![endif]>The slides for the presentations are due on Friday December 6, 2024 at 12:00 PM Eastern. <o:p>
<o:p>
Schedule/Syllabus (Subject to Change) <o:p>
Date<o:p> | Lecturer<o:p> | Readings<o:p> |
Fri Sep 6 | Tina Eliassi-Rad | Overview<o:p> <![if !supportLists]>� <![endif]>The Why, How, and When of Representations for Complex Systems<o:p> |
Tue Sep 10 | Tina Eliassi-Rad | Homophily<o:p> <![if !supportLists]>� <![endif]>Distribution of Node Characteristics in Complex Networks<o:p> <![if !supportLists]>� <![endif]>Combinatorial Characterizations and Impossibilities for Higher-order Homophily<o:p> <![if !supportLists]>� <![endif]>Higher-order Homophily on Simplicial Complexes<o:p> Axiomatic Approaches<o:p> <![if !supportLists]>� <![endif]>Measuring Tie Strength in Implicit Social Networks <o:p> |
Fri Sep 13 | Tina Eliassi-Rad | Network Comparison and Graph Distances<o:p> <![if !supportLists]>� <![endif]>A Guide to Selecting a Network Similarity Method <o:p> <![if !supportLists]>� <![endif]>Network Comparison and the Within-ensemble Graph Distance<o:p> <![if !supportLists]>� <![endif]>Non-backtracking Cycles: Length Spectrum Theory and Graph Mining Applications<o:p> <![if !supportLists]>� <![endif]>[reference] netrd: A library for Network Reconstruction and Graph Distances<o:p> |
Tue Sep 17 | Tina Eliassi-Rad | Role Discovery<o:p> <![if !supportLists]>� <![endif]>It's Who You Know: Graph Mining Using Recursive Structural Features<o:p> <![if !supportLists]>� <![endif]>RolX: Structural Role Extraction & Mining in Large Graphs<o:p> <![if !supportLists]>� <![endif]>Guided learning for Role Discovery (GLRD): Framework, Algorithms, and Applications<o:p> |
Fri Sep 20 | Tina Eliassi-Rad | Graph Representation Learning: Node Embedding<o:p> <![if !supportLists]>� <![endif]>Laplacian Eigenmaps for Dimensionality Reduction and Data Representation<o:p> <![if !supportLists]>� <![endif]>node2vec: Scalable Feature Learning for Networks<o:p> <![if !supportLists]>� <![endif]>Structural Deep Network Embedding<o:p> <![if !supportLists]>� <![endif]>STABLE: Identifying and Mitigating Instability in Embeddings of the Degenerate Core<o:p> <![if !supportLists]>� <![endif]>[reference] Machine Learning on Graphs: A Model and Comprehensive Taxonomy<o:p> <![if !supportLists]>� <![endif]>[reference] Network Representation Learning: From Preprocessing, Feature Extraction to Node Embedding<o:p> <![if !supportLists]>� <![endif]>[optional] Next Waves in Veridical Network Embedding <o:p> |
Tue Sep 24 | Team 1: Sharaj Kunjar, Tamanna Urmi, Xiyu Yang | Low-rank Representations of Complex Networks<o:p> <![if !supportLists]>� <![endif]>The Impossibility of Low-rank Representations for Triangle-Rich Complex Networks<o:p> <![if !supportLists]>� <![endif]>Node Embeddings and Exact Low-rank Representations of Complex Networks<o:p> <![if !supportLists]>� <![endif]>[optional] Classic Graph Structural Features Outperform Factorization-Based Graph Embedding Methods on Community Labeling<o:p> <![if !supportLists]>� <![endif]>[optional] Link Prediction Using Low-dimensional Node Embeddings: The Measurement Problem<o:p> |
Fri Sep 27 | Team 2: Amanuel Tesfaye, Narayan Sabhahit, Yiyuan Zhang | Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>Semi-Supervised Classification with Graph Convolutional Networks<o:p> <![if !supportLists]>� <![endif]>Graph Attention Networks<o:p> <![if !supportLists]>� <![endif]>Gated Graph Sequence Neural Networks<o:p> <![if !supportLists]>� <![endif]>[reference] Graph Neural Networks: A Review of Methods and Applications<o:p> <![if !supportLists]>� <![endif]>[reference] A Comprehensive Survey on Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>[optional] Everything is Connected: Graph Neural Networks<o:p> |
Tue Oct 1 | Team 3: Jesseba Fernando, Joshua Josh Rosen, Yicheng Andy Zhang | This and That<o:p> <![if !supportLists]>� <![endif]>Hyperbolic Graph Convolutional Neural Networks<o:p> <![if !supportLists]>� <![endif]>Principal Neighbourhood Aggregation for Graph Nets<o:p> <![if !supportLists]>� <![endif]>Pitfalls of Graph Neural Network Evaluation<o:p> <![if !supportLists]>� <![endif]>[optional] The Numerical Stability of Hyperbolic Representation Learning<o:p> |
Fri Oct 4 | Team 1: Sharaj Kunjar, Tamanna Urmi, Xiyu Yang | Collective Classification<o:p> <![if !supportLists]>� <![endif]>Collective Classification in Network Data<o:p> <![if !supportLists]>� <![endif]>Graph Belief Propagation Networks<o:p> <![if !supportLists]>� <![endif]>[optional] Cautious Collective Classification<o:p> |
Tue Oct 8 | Team 2: Amanuel Tesfaye, Narayan Sabhahit, Yiyuan Zhang | Label Propagation on Graphs<o:p> <![if !supportLists]>� <![endif]>Combining Label Propagation and Simple Models Out-performs Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification<o:p> <![if !supportLists]>� <![endif]>Design Space for Graph Neural Networks (GitHub page)<o:p> <![if !supportLists]>� <![endif]>[optional] Message passing all the way up<o:p> Class project proposals are due at 11:59 PM Eastern. |
Fri Oct 11 | Team 3: Jesseba Fernando, Joshua Josh Rosen, Yicheng Andy Zhang | Graph Transformers<o:p> <![if !supportLists]>� <![endif]>Graph Transformer Networks<o:p> <![if !supportLists]>� <![endif]>A Generalization of Transformer Networks to Graphs<o:p> <![if !supportLists]>� <![endif]>Transformers are Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>[reference] Graph Transformers: A Survey<o:p> |
Tue Oct 15 | Team 1: Sharaj Kunjar, Tamanna Urmi, Xiyu Yang | ML on Heterogeneous Graphs<o:p> <![if !supportLists]>� <![endif]>Modeling Relational Data with Graph Convolutional Networks<o:p> <![if !supportLists]>� <![endif]>Heterogeneous Graph Transformer<o:p> |
Fri Oct 18 | Team 2:Amanuel Tesfaye, Narayan Sabhahit, Yiyuan Zhang | GNNs for Recommendation Systems<o:p> <![if !supportLists]>� <![endif]>Neural Graph Collaborative Filtering<o:p> <![if !supportLists]>� <![endif]>Graph Convolutional Neural Networks for Web-Scale Recommender Systems<o:p> |
Tue Oct 22 | Team 3: Jesseba Fernando, Joshua Josh Rosen, Yicheng Andy Zhang | W-L Graph Kernels and Power of GNNs<o:p> <![if !supportLists]>� <![endif]>Weisfeiler-Lehman Graph Kernels<o:p> <![if !supportLists]>� <![endif]>How Powerful are Graph Neural Networks?<o:p> <![if !supportLists]>� <![endif]>[optional] A Reduction of a Graph to a Canonical Form and an Algebra arising during this Reduction<o:p> <![if !supportLists]>� <![endif]>[reference] Theory of Graph Neural Networks: Representation and Learning<o:p> |
Fri Oct 25 | Team 1: Sharaj Kunjar, Tamanna Urmi, Xiyu Yang | Stability and Counting in GNNs<o:p> <![if !supportLists]>� <![endif]>Tree Mover s Distance: Bridging Graph Metrics and Stability of Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>Can Graph Neural Networks Count Substructures?<o:p> |
Tue Oct 29 | Team 2: Amanuel Tesfaye, Narayan Sabhahit, Yiyuan Zhang | Invariance and Equivariance <![if !supportLists]>� <![endif]>E(n) Equivariant Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>Invariant and Equivariant Graph Networks<o:p> |
Fri Nov 1 | Team 3: Jesseba Fernando, Joshua Josh Rosen, Yicheng Andy Zhang | Hypergraphs and Higher-order Models<o:p> <![if !supportLists]>� <![endif]>Random Walks on Hypergraphs with Edge-Dependent Vertex Weights <o:p> <![if !supportLists]>� <![endif]>Hypergraph Neural Networks<o:p> <![if !supportLists]>� <![endif]>[reference] A Survey on Hypergraph Mining: Patterns, Tools, and Generators<o:p> <![if !supportLists]>� <![endif]>[optional] Simplicial Attention Networks<o:p> |
Tue Nov 5 | Team 1: Sharaj Kunjar, Tamanna Urmi, Xiyu Yang | Graph ML for Optimization<o:p> <![if !supportLists]>� <![endif]>Assigning Entities to Teams as a Hypergraph Discovery Problem<o:p> <![if !supportLists]>� <![endif]>Distributed constrained combinatorial optimization leveraging hypergraph neural networks<o:p> |
Fri Nov 8<o:p> | Guest Lecturer: | Cyber networks and ML<o:p> <![if !supportLists]>� <![endif]>Cyber Network Resilience against Self-Propagating Malware Attacks<o:p> <![if !supportLists]>� <![endif]>Modeling Self-Propagating Malware with Epidemiological Models<o:p> |
Tue Nov 12 | Team 2: Amanuel Tesfaye, Narayan Sabhahit, Yiyuan Zhang | Explainability in GNNs<o:p> <![if !supportLists]>� <![endif]>Explainability in Graph Neural Networks: A Taxonomic Survey<o:p> <![if !supportLists]>� <![endif]>GNNExplainer: Generating Explanations for Graph Neural Networks<o:p> |
Fri Nov 15 | Team 3: Jesseba Fernando, Joshua Josh Rosen, Yicheng Andy Zhang | Explainability and Trustworthiness of GNNs<o:p> <![if !supportLists]>� <![endif]>GraphFramEx: Towards Systematic Evaluation of Explainability Methods for Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>Trustworthy Graph Neural Networks: Aspects, Methods and Trends<o:p> |
Tue Nov 19 | Team 1: Sharaj Kunjar, Tamanna Urmi, Xiyu Yang | Fairness and Equality<o:p> <![if !supportLists]>� <![endif]>FAIRGEN: Towards Fair Graph Generation<o:p> <![if !supportLists]>� <![endif]>Information Access Equality on Generative Models of Complex Networks<o:p> |
Fri Nov 22 | Team 2: Amanuel Tesfaye, Narayan Sabhahit, Yiyuan Zhang | Oversmooting and Oversquashing<o:p> <![if !supportLists]>� <![endif]>A Survey on Oversmoothing in Graph Neural Networks<o:p> <![if !supportLists]>� <![endif]>How does over-squashing affect the power of GNNs?<o:p> <![if !supportLists]>� <![endif]>Neural Sheaf Diffusion: A Topological Perspective on Heterophily and Oversmoothing in GNNs<o:p> |
Tue Nov 26 | Team 3: Jesseba Fernando, Joshua Josh Rosen, Yicheng Andy Zhang | ICML 2024 Position Papers<o:p> <![if !supportLists]>� <![endif]>Future Directions in the Theory of Graph Machine Learning<o:p> <![if !supportLists]>� <![endif]>Why We Must Rethink Empirical Research in Machine Learning <o:p> |
Fri Nov 29 | No class Thanksgiving break | |
Tue Dec 3<o:p> | Guest lecturer: | ML for Computational Social Science<o:p> <![if !supportLists]>� <![endif]>Using Sequences of Life-events to Predict Human Lives |
Fri Dec 6 | In-class project presentations | Your slides are due at 12pm Eastern. |
Tue Dec 10 | No class Project reports are due on Tue Dec 10 at 11:59pm Eastern. | |
Fri Dec 13 | No class |
<o:p>