Summary
HtmlCleaner is an open source HTML parser written in Java. HTML found on the Web is usually dirty, ill-formed and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring some order to the tags, attributes and ordinary text. For any given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows similar rules that the most of web browsers use in order to create the Document Object Model. However, you can provide custom tag and rule sets for tag filtering and balancing.
Versions
v2.29 :: 0 :: gentoo
- Modified
- License
- BSD
- Keywords
- ~amd64 ~x86
- USE flags
- doc source test
v2.26 :: 0 :: gentoo
- Modified
- License
- BSD
- Keywords
- amd64 x86
- USE flags
- doc source test
USE flags
General
Dependencies
app-arch / zip : Info ZIP (encryption support)
dev-java / ant-core : Java-based build tool similar to 'make' that uses XML configuration files
dev-java / java-config : Java environment configuration query tool
dev-java / jdom : Java-based solution for accessing, manipulating, and outputting XML data
dev-java / junit : Simple framework to write repeatable tests
Runtime Dependencies
app-arch / zip : Info ZIP (encryption support)
dev-java / java-config : Java environment configuration query tool
dev-java / jdom : Java-based solution for accessing, manipulating, and outputting XML data
Change logs
- Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: add 2.29 - CVE-2023-34624
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/33883 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: drop 2.25
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/27108 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Jakov Smolić · gentoo
dev-java/htmlcleaner: Stabilize 2.26 x86, #867805
Signed-off-by: Jakov Smolić <jsmolic@gentoo.org> - Jakov Smolić · gentoo
dev-java/htmlcleaner: Stabilize 2.26 amd64, #867805
Signed-off-by: Jakov Smolić <jsmolic@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: add 2.26
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/26697 Signed-off-by: Florian Schmaus <flow@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Agostino Sarubbo · gentoo
dev-java/htmlcleaner: x86 stable wrt bug #816375
Package-Manager: Portage-3.0.20, Repoman-3.0.3 RepoMan-Options: --include-arches="x86" Signed-off-by: Agostino Sarubbo <ago@gentoo.org> - Agostino Sarubbo · gentoo
dev-java/htmlcleaner: amd64 stable wrt bug #816375
Package-Manager: Portage-3.0.20, Repoman-3.0.3 RepoMan-Options: --include-arches="amd64" Signed-off-by: Agostino Sarubbo <ago@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Miroslav Šulc · gentoo
dev-java/htmlcleaner: removed obsolete 2.24
Package-Manager: Portage-3.0.28, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Miroslav Šulc · gentoo
dev-java/htmlcleaner: bump to 2.25
Package-Manager: Portage-3.0.26, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Miroslav Šulc · gentoo
dev-java/htmcleaner: fixed tests
Package-Manager: Portage-3.0.18, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: patched
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/20560 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: remove old
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: bump to 2.24
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org> - Repository mirror & CI · gentoo
Merge updates from master - Michał Górny · gentoo
*/*: [QA] Fix trivial cases of MissingTestRestrict
The result was achieved via the following pipeline: pkgcheck scan -c RestrictTestCheck -R FormatReporter \ --format '{category}/{package}/{package}-{version}.ebuild' | xargs -n32 grep -L RESTRICT | xargs -n32 sed -i -e '/^IUSE=.*test/aRESTRICT="!test? ( test )"' The resulting metadata was compared before and after the change. Few Go ebuilds had to be fixed manually due to implicit RESTRICT=strip added by the eclass. Two ebuilds have to be fixed because of multiline IUSE. Suggested-by: Robin H. Johnson <robbat2@gentoo.org> Closes: https://github.com/gentoo/gentoo/pull/13942 Signed-off-by: Michał Górny <mgorny@gentoo.org> - Robin H. Johnson · gentoo
Drop $Id$ per council decision in bug #611234.
Signed-off-by: Robin H. Johnson <robbat2@gentoo.org> - Patrice Clement · gentoo
dev-java/htmlcleaner: Stable for amd64. Retroactively mark stable for the remaining arches using the ALLARCHES policy.
Package-Manager: portage-2.2.26 - James Le Cuirot · gentoo
dev-java/htmlcleaner: JAVA_CLASSPATH_EXTRA has been renamed
Package-Manager: portage-2.2.20.1 - James Le Cuirot · gentoo
dev-java/htmlcleaner: Imported and bumped from java-overlay
Closes bug #369977. Package-Manager: portage-2.2.20.1