Extraction of tabular data in HTML file using AI / ML or Deep learning

Hello friends,

I am doing a project in which I need to extract the tabular information from the HTML file using AI or Deep learning algorithm for around 1000 companies.

Problem statement

  1. The tables are located at any place in the 50-60 page HTML file.
  2. How to locate the specific table?
  3. Each table has a different dimension (no. of columns varies).
    Example -
    https://www.sec.gov/Archives/edgar/data/789019/000119312519268531/d791036ddef14a.htm#toc791036_17

Page no. 43 - Summary Compensation table
Page no. 45 - Grant of Plan-Based Awards table.
likewise rest of the table.

I am thinking of using cognitive service. Will it work for the above problem statement? Please help to solve this problem.
What Microsoft AI service it will require? And step to follow for the same.
Thank you!
A