Data Leakage

We all know data leakage is a serious problem while building ML models. I was wondering if there are any publicly available datasets which any of you came across which has a serious case of data leakage. It’d be great if you can share your experience. This is one example. I am searching for similar datasets.

Suggestions and advises are always welcome.