Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
yencabulator
on Dec 3, 2022
|
parent
|
context
|
favorite
| on:
Pup: Parsing HTML at the command line
https://github.com/cloudflare/lol-html
is a HTML rewriter that explicitly does not construct DOM or an AST, but streams nodes throughs ASAP.
https://blog.cloudflare.com/html-parsing-1/
https://blog.cloudflare.com/html-parsing-2/
mh-
on Dec 3, 2022
[–]
This is the best approach if they don't need to consider client-side JS mutations. Not sure why I interpreted it that way in my original reply. Thanks!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
https://blog.cloudflare.com/html-parsing-1/
https://blog.cloudflare.com/html-parsing-2/