pkgsrc-Changes-HG archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
[pkgsrc/trunk]: pkgsrc/www/p5-HTML-Parser Update p5-HTML-Parser from version ...
details: https://anonhg.NetBSD.org/pkgsrc/rev/bcc319e53355
branches: trunk
changeset: 485278:bcc319e53355
user: he <he%pkgsrc.org@localhost>
date: Sun Dec 05 18:38:58 2004 +0000
description:
Update p5-HTML-Parser from version 3.35 to 3.42.
Change HOMEPAGE to author-independent link on search.cpan.org.
Change log:
2004-12-04 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.42
Avoid sv_catpvn_utf8_upgrade() as that macro was not
available in perl-5.8.0.
Patch by Reed Russell <Russell.Reed%acxiom.com@localhost>.
Add casts to suppress compilation warnings for char/U8
mismatches.
HTML::HeadParser will always push new header values.
This make sure we never loose old header values.
2004-11-30 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.41
Fix unresolved symbol error with perl-5.005.
2004-11-29 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.40
Make utf8_mode only available on perl-5.8 or better. It produced
garbage with older versions of perl.
Emit warning if entities are decoded and something in the first
chunk looks like hibit UTF-8. Previously this warning was only
triggered for documents with BOM.
2004-11-23 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.39_92
More documentation of the Unicode issues. Moved around HTML::Parser
documentation a bit.
New boolean option; $p->utf8_mode to allow parsing of raw UTF-8.
Documented that HTML::Entities::decode_entities() can take multiple
arguments.
Unterminated entities are now decoded in text (compatibility
with MSIE misfeature).
Document HTML::Entities::_decode_entities(); this variation of the
decode_entities() function has been available for a long time, but
have not been documented until now.
HTML::Entities::_decode_entities() can now be told to try to
expand unterminated entities.
Simplified Makefile.PL
2004-11-23 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.39_91
The HTML::HeadParser will skip Unicode BOM. Previously it
would consider the <head> section done when it saw the BOM.
The parser will look for Unicode BOM and give appropriate
warnings if the form found indicate trouble.
If no matching end tag is found for <script>, <style>, <xmp>
<title>, <textarea> then generate one where the next tag
starts.
For <script> and <style> recognize quoted strings and don't
consider end element if the corresponding end tag is found
inside such a string.
2004-11-17 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.39_90
The <title> element is now parsed in literal mode, which
means that other tags are not recognized until </title> has
been seen.
Unicode support for perl-5.8 and better.
Decoding Unicode entities always enabled; no longer a compile
time option.
Propagation of UTF8 state on strings.
Patch contributed by John Gardiner Myers <jgmyers%proofpoint.com@localhost>.
Calculate offsets and lengths in chars for Unicode strings.
Fixed link typo in the HTML::TokeParser documentation.
2004-11-11 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.38
New boolean option; $p->closing_plaintext
Contributed by Alex Kapranoff <alex%kapranoff.ru@localhost>
2004-11-10 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.37
Improved handling of HTML encoded surrogate pairs and illegally
endoded Unicode; <http://rt.cpan.org/Ticket/Display.html?id=7785>.
Patch by John Gardiner Myers <jgmyers%proofpoint.com@localhost>.
Avoid generating bad UTF8 strings when decoding entities
representing chars beyond #255 in 8-bit strings. Such bad
UTF8 sometimes made perl-5.8.5 and older segfault.
Undocument v2 style subclassing in synopsis section.
Internal cleanup:
Make 'gcc -Wall' happier.
Avoid modification of PVs during parsing of attrspec.
Another patch by John Gardiner Myers.
2004-04-01 Gisle Aas <gisle%ActiveState.com@localhost>
Release 3.36
Improved MSIE/Mozilla compatibility. If the same attribute
name repeats for a start tag, use the first value instead
of the last. Patch by Nick Duffek <html-parser%duffek.com@localhost>.
<https://rt.cpan.org/Ticket/Display.html?id=5472>
diffstat:
www/p5-HTML-Parser/Makefile | 6 +++---
www/p5-HTML-Parser/distinfo | 7 +++----
www/p5-HTML-Parser/patches/patch-aa | 13 -------------
3 files changed, 6 insertions(+), 20 deletions(-)
diffs (50 lines):
diff -r 8236a5b56313 -r bcc319e53355 www/p5-HTML-Parser/Makefile
--- a/www/p5-HTML-Parser/Makefile Sun Dec 05 18:29:01 2004 +0000
+++ b/www/p5-HTML-Parser/Makefile Sun Dec 05 18:38:58 2004 +0000
@@ -1,14 +1,14 @@
-# $NetBSD: Makefile,v 1.26 2004/04/24 22:46:13 jlam Exp $
+# $NetBSD: Makefile,v 1.27 2004/12/05 18:38:58 he Exp $
#
-DISTNAME= HTML-Parser-3.35
+DISTNAME= HTML-Parser-3.42
PKGNAME= p5-${DISTNAME}
SVR4_PKGNAME= p5hpa
CATEGORIES= www perl5
MASTER_SITES= ${MASTER_SITE_PERL_CPAN:=HTML/}
MAINTAINER= tech-pkg%NetBSD.org@localhost
-HOMEPAGE= http://search.cpan.org/author/GAAS/HTML-Parser/
+HOMEPAGE= http://search.cpan.org/dist/HTML-Parser/
COMMENT= Perl5 module to parse HTML text documents
DEPENDS+= p5-HTML-Tagset>=3.0:../../www/p5-HTML-Tagset
diff -r 8236a5b56313 -r bcc319e53355 www/p5-HTML-Parser/distinfo
--- a/www/p5-HTML-Parser/distinfo Sun Dec 05 18:29:01 2004 +0000
+++ b/www/p5-HTML-Parser/distinfo Sun Dec 05 18:38:58 2004 +0000
@@ -1,5 +1,4 @@
-$NetBSD: distinfo,v 1.8 2004/01/20 13:41:19 abs Exp $
+$NetBSD: distinfo,v 1.9 2004/12/05 18:38:58 he Exp $
-SHA1 (HTML-Parser-3.35.tar.gz) = 2bdcb02bbd80d49d77d9ed086ff2659043758473
-Size (HTML-Parser-3.35.tar.gz) = 75266 bytes
-SHA1 (patch-aa) = 2db44b7ffb783264f0fd2db79449d1408745bcee
+SHA1 (HTML-Parser-3.42.tar.gz) = 2d540a74cbc6375c00e69986e6628b1da0e4a40b
+Size (HTML-Parser-3.42.tar.gz) = 81771 bytes
diff -r 8236a5b56313 -r bcc319e53355 www/p5-HTML-Parser/patches/patch-aa
--- a/www/p5-HTML-Parser/patches/patch-aa Sun Dec 05 18:29:01 2004 +0000
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
@@ -1,13 +0,0 @@
-$NetBSD: patch-aa,v 1.2 2003/08/25 00:00:09 heinz Exp $
-
---- Makefile.PL.orig Fri Aug 15 17:32:56 2003
-+++ Makefile.PL
-@@ -19,7 +19,7 @@ the question below such entities will be
- in the Latin-1 range is decoded.
-
- EOT
-- my $ans = prompt("Do you want decoding on unicode entities?", "no");
-+ my $ans = "y"; #prompt("Do you want decoding on unicode entities?", "no");
- if ($ans =~ /^y(es)?$/i) {
- push(@define, "-DUNICODE_ENTITIES");
- }
Home |
Main Index |
Thread Index |
Old Index