Summary
HtmlCleaner is an open source HTML parser written in Java. HTML found on the Web is usually dirty, ill-formed and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring some order to the tags, attributes and ordinary text. For any given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows similar rules that the most of web browsers use in order to create the Document Object Model. However, you can provide custom tag and rule sets for tag filtering and balancing.
Versions
v2.29-r1 :: 0 :: gentoo
- Modified
- License
- BSD
- Keywords
- amd64
- USE flags
- doc source test
USE flags
General
Dependencies
app-arch / zip : Info ZIP (encryption support)
dev-java / ant : Java-based build tool similar to 'make' that uses XML configuration files
dev-java / java-config : Java environment configuration query tool
dev-java / jdom : Java-based solution for accessing, manipulating, and outputting XML data
dev-java / junit : Simple framework to write repeatable tests
Runtime Dependencies
app-arch / zip : Info ZIP (encryption support)
dev-java / java-config : Java environment configuration query tool
dev-java / jdom : Java-based solution for accessing, manipulating, and outputting XML data
Change logs
- Repository mirror & CI · gentoo
Merge updates from master - Arthur Zamarin · gentoo
dev-java/htmlcleaner: dekeyword x86
After a message to gentoo-dev ML, and no reply from any user on x86, we decided to dekeyword most Java packages from x86, and reverse dependencies of them. Signed-off-by: Arthur Zamarin <arthurzam@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Lucio Sauer · gentoo
*/*: inline mirror://sourceforge
bump copyright of touched ebuilds to 2024 Signed-off-by: Lucio Sauer <watermanpaint@posteo.net> Signed-off-by: Michał Górny <mgorny@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: switch to dev-java/ant
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/35723/commits/3d2a3e632890240c092f67ad33077c6159b4a9ae Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: drop 2.26
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/34473 Signed-off-by: Arthur Zamarin <arthurzam@gentoo.org> - Arthur Zamarin · gentoo
dev-java/htmlcleaner: Stabilize 2.29 x86, #920688
Signed-off-by: Arthur Zamarin <arthurzam@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Sam James · gentoo
dev-java/htmlcleaner: Stabilize 2.29 amd64, #920688
Signed-off-by: Sam James <sam@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: add 2.29 - CVE-2023-34624
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/33883 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: drop 2.25
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/27108 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Jakov Smolić · gentoo
dev-java/htmlcleaner: Stabilize 2.26 x86, #867805
Signed-off-by: Jakov Smolić <jsmolic@gentoo.org> - Jakov Smolić · gentoo
dev-java/htmlcleaner: Stabilize 2.26 amd64, #867805
Signed-off-by: Jakov Smolić <jsmolic@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: add 2.26
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/26697 Signed-off-by: Florian Schmaus <flow@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Agostino Sarubbo · gentoo
dev-java/htmlcleaner: x86 stable wrt bug #816375
Package-Manager: Portage-3.0.20, Repoman-3.0.3 RepoMan-Options: --include-arches="x86" Signed-off-by: Agostino Sarubbo <ago@gentoo.org> - Agostino Sarubbo · gentoo
dev-java/htmlcleaner: amd64 stable wrt bug #816375
Package-Manager: Portage-3.0.20, Repoman-3.0.3 RepoMan-Options: --include-arches="amd64" Signed-off-by: Agostino Sarubbo <ago@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Miroslav Šulc · gentoo
dev-java/htmlcleaner: removed obsolete 2.24
Package-Manager: Portage-3.0.28, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Miroslav Šulc · gentoo
dev-java/htmlcleaner: bump to 2.25
Package-Manager: Portage-3.0.26, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Miroslav Šulc · gentoo
dev-java/htmcleaner: fixed tests
Package-Manager: Portage-3.0.18, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: patched
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/20560 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: remove old
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: bump to 2.24
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Michał Górny · gentoo
*/*: [QA] Fix trivial cases of MissingTestRestrict
The result was achieved via the following pipeline: pkgcheck scan -c RestrictTestCheck -R FormatReporter \ --format '{category}/{package}/{package}-{version}.ebuild' | xargs -n32 grep -L RESTRICT | xargs -n32 sed -i -e '/^IUSE=.*test/aRESTRICT="!test? ( test )"' The resulting metadata was compared before and after the change. Few Go ebuilds had to be fixed manually due to implicit RESTRICT=strip added by the eclass. Two ebuilds have to be fixed because of multiline IUSE. Suggested-by: Robin H. Johnson <robbat2@gentoo.org> Closes: https://github.com/gentoo/gentoo/pull/13942 Signed-off-by: Michał Górny <mgorny@gentoo.org> - Robin H. Johnson · gentoo
Drop $Id$ per council decision in bug #611234.
Signed-off-by: Robin H. Johnson <robbat2@gentoo.org> - Patrice Clement · gentoo
dev-java/htmlcleaner: Stable for amd64. Retroactively mark stable for the remaining arches using the ALLARCHES policy.
Package-Manager: portage-2.2.26 - James Le Cuirot · gentoo
dev-java/htmlcleaner: JAVA_CLASSPATH_EXTRA has been renamed
Package-Manager: portage-2.2.20.1 - James Le Cuirot · gentoo
dev-java/htmlcleaner: Imported and bumped from java-overlay
Closes bug #369977. Package-Manager: portage-2.2.20.1