On this page I have a number of items to get the interested reader started with persistent homology and topological data analysis. If you know linear algebra you are ready to start!
Introduction to Topological Data Analysis and Persistent Homology
- Introduction to Persistent Homology, a great YouTube video, by Matthew Wright
- Studying the Shape of Data Using Topology, a brief non-technical introduction by Michael Lesnick
- A User’s Guide to Topological Data Analysis, by Elizabeth Munch.
- To help learn the basics, complete my worksheet.
The main technical tool for persistent homology is simplicial homology. For persistent homology, we use coefficients in a field. So simplicial k-chains are vectors and the set of simplicial k-chains is a vector space. Furthermore, the boundary map is a linear transformation. For finite simplices, it is represented by a matrix.
Topological Data Analysis with R
If you want to get started doing topological data analysis.
- My Introduction to TDA with R script: intro_tda.R. Copy and paste or download the file and rename it intro_tda.R.
- My older TDA with R workshop: instructions and R files tda_functions.R, tda_workshop_script.R, and persistence_script.R. Rename the files from *.txt to *.R.
- Use Jose Bouza’s tda-tools R package.
Topological Data Analysis and Persistent Homology
Here are some recent introductory articles. If you want to learn more about the subject I would recommend starting here. The first two are mathematical, the third emphasizes connections to data science, and the fourth is more statistical.
- A Brief History of Persistence, by Jose Perea
- Homological Algebra and Data, by Robert Ghrist
- An introduction to Topological Data Analysis: fundamental and practical aspects for data scientists, by Frédéric Chazal and Bertrand Michel
- Topological Data Analysis, by Larry Wasserman
There is a Wikipedia page.
The following slightly older introductory articles provide background, some mathematical details and a few applications.
- Persistent Homology – a Survey, by Herbert Edelsbrunner and John Harer
- Barcodes: The persistent topology of data, by Robert Ghrist
- Topology and data, by Gunnar Carlsson
The following are more technical summaries of some of the main results in the field.
- Persistent Homology, by Herbert Edelsbrunner and Dmitriy Morozov
- High Dimensional Topological Data Analysis, by Frederic Chazal
For a serious introduction, I highly recommend the following new book. It is an excellent resource for mathematics graduate students wanting to learn the subject.
- Persistence Theory: From Quiver Representations to Data Analysis, by Steve Oudot.