RSS | Module Info | Add a review of HTML-TableParser
HTML-TableParser
(0.38)
Works well out of the box, extracting the text in table cells.
However, there's no way to extract additional HTML information from TD elements, for instance links or img src attributes. For that, use HTML::TableExtract.
Dan Dascalescu - 2012-04-16T16:13:16 (permalink)
Was this review helpful to you?
Yes
No
HTML-TableParser
(0.38)
- doesnt handle "flipped" tables (th's on rows not columns)
- MultiMatch should be the default
w george - 2009-05-15T09:15:30 (permalink)
Was this review helpful to you?
Yes
No
HTML-TableParser
(0.38)
About as close as possible to a perfect Do-What-I-Mean solution to an otherwise painful, complicated, process.
Anirvan Chatterjee - 2009-02-03T12:22:59 (permalink)
2 out of 2 found this review helpful.
Was this review helpful to you?
Yes
No
HTML-TableParser
(0.34)
this is a good module to retrive data in HTML table~~~
the following is a sample program to get weather forecast from
tw.weather.yahoo.com/tomorrow.html
and print the result to stdout.
use LWP::UserAgent;
use HTTP::Request::Common;
use HTML::TableParser;
#
# get web page tw.weather.yahoo.com/tomorrow.html
#
$ua = LWP::UserAgent->new;
$res = $ua->request(GET 'tw.weather.yahoo.com/tomorrow.html');
@content = split "\n", $res->as_string;
foreach (@content){
if(/^(民國.*)/){
print $_, "\n\n";
}
}
#
# reference
# search.cpan.org/~djerius/HTML-TablePa...
#
@reqs = (
{
id => 5.1, # id for embedded table
row => \&row, # function callback
}
);
# create parser object
$p = HTML::TableParser->new( \@reqs,
{ Decode => 0, Trim => 0, Chomp => 0 } );
$p->parse($res->as_string);
# function callbacks
sub row {
my ( $id, $line, $cols, $udata ) = @_;
print join "\t", @$cols;
print "\n";
}
macarthur - 2004-08-21T09:05:55 (permalink)
1 out of 1 found this review helpful.
Was this review helpful to you?
Yes
No

