High dimensional data refers to datasets that have a large number of variables (features) compared to the number of observations. In the context of epidemiology, high dimensional data can include genetic information, electronic health records, and multi-omics data. These datasets often contain thousands to millions of variables, posing unique challenges and opportunities for data analysis.