Disclosure Statement: This site contains affiliate links, which means that I may receive a commission if you make a purchase using these links. As an eBay Partner, I earn from qualifying purchases.
If you find these projects useful please consider becoming a sponsor with Patreon, GitHub or Liberapay.

HTML parsing

Post Reply
PioPio
Posts: 42
Joined: Sun Nov 05, 2017 10:25 pm

HTML parsing

Post by PioPio »

Hello,

I am trying to extract some information from HTML pages. Looking into the DOMVisitor example I can see the method GetElementById can isolate a part of HTML page containing the information I need.
However, the HTML isolated as such, needs to be parsed anyway because there are nested tables, tags and so on in it and I was wondering which of the following is the best way to achieve this:
  • Using an HTML Parser. If so, I didn't find anything in CEF4Delphi (I suppose it is because this not the purpose of CEF4Delphi). Is there anything Open Source you would recommend for HTML parsing ?
Many thanks
Alberto
Post Reply