Suppose I scan the following: <b>Pro</b>gressives vs <b>con</b>servatives <a href="v">here</a>. The resultant clean html has undesirable newlines after the close tags. This is especially a problem for mostly-plain-text input where newlines get translated into line breaks.