[Owasp-antisamy] pluggable HTML parser

Arshan Dabirsiaghi arshan.dabirsiaghi at aspectsecurity.com
Thu Oct 30 12:25:55 EDT 2008


If you provide this patch it will most likely be integrated, although  
Tidy and others suffer from the issue of always providing a complete  
XHTML document.

Arshan


On Oct 30, 2008, at 11:12 AM, "Chase Seibert" <chase.seibert+antisamy at gmail.com 
 > wrote:

> Would it be possible for AntiSamy to support HTML parsers besides  
> nekoHTML? I don't currently have any problems with nekoHTML, but if  
> another parser is faster or more tolerant of malformed HTML, it  
> would be nice to have a choice.
>
> It seems that popular parsers (nekoHTML, jTidy and TagSoup) don't  
> implement a common interface. However, they do all have methods that  
> return a standard org.w3c.dom Document or DocumentFragment. Perhaps  
> an interface could be defined on the AntiSamy side for what it needs  
> from a parser? Ideally, an implementation of that interface could  
> then be provided by class name in a properties file.
>
> I would be willing to code this up, I just wanted to get opinions  
> first.
>
>   - Chase
>
>
> _______________________________________________
> Owasp-antisamy mailing list
> Owasp-antisamy at lists.owasp.org
> https://lists.owasp.org/mailman/listinfo/owasp-antisamy


More information about the Owasp-antisamy mailing list