Page 1 of 1

HTML parsing

Posted: Mon Jan 08, 2018 10:49 am
by PioPio
Hello,

I am trying to extract some information from HTML pages. Looking into the DOMVisitor example I can see the method GetElementById can isolate a part of HTML page containing the information I need.
However, the HTML isolated as such, needs to be parsed anyway because there are nested tables, tags and so on in it and I was wondering which of the following is the best way to achieve this:
  • Using an HTML Parser. If so, I didn't find anything in CEF4Delphi (I suppose it is because this not the purpose of CEF4Delphi). Is there anything Open Source you would recommend for HTML parsing ?
Many thanks
Alberto

Re: HTML parsing

Posted: Mon Jan 08, 2018 11:09 am
by salvadordf