I needed to strip tags from a lot of small HTML documents leaving only a small subset of tags behind and after experimenting with a few CPAN modules I found out that HTML::PullParser is small and fast enough to do the job. Interface is small and clean. And it works very good. I like it and will use it in the future.
I needed to download images from a web page, and used HTML::LinkExtor (part of HTML-Parser) to extract the image src links (based on an example on the module POD!).
It took me more time to download the images (lots of them on the page) then to write the code!
(A couple of things aren't clear in the documentation, such as whether the attribute hash for the callback is case-sensitive, or if only link attributes are extracted; fortunately for my immediate needs this was not an issue, but it should be explicitly documented.)