Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

https://github.com/cloudflare/lol-html is a HTML rewriter that explicitly does not construct DOM or an AST, but streams nodes throughs ASAP.

https://blog.cloudflare.com/html-parsing-1/

https://blog.cloudflare.com/html-parsing-2/



This is the best approach if they don't need to consider client-side JS mutations. Not sure why I interpreted it that way in my original reply. Thanks!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: