dev-java / htmlcleaner

HTML parser written in Java that can be used as a tool, library or Ant task

Official package sites : https://htmlcleaner.sourceforge.net/ ·

HtmlCleaner is an open source HTML parser written in Java. HTML found on the Web is usually dirty, ill-formed and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring some order to the tags, attributes and ordinary text. For any given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows similar rules that the most of web browsers use in order to create the Document Object Model. However, you can provide custom tag and rule sets for tag filtering and balancing.

v2.29-r1 :: 0 :: gentoo

Modified
License
BSD
Keywords
amd64
USE flags
doc source test

General

doc
Add extra documentation (API, Javadoc, etc). It is recommended to enable per package instead of globally
source
Zip the sources and install them
test
Enable dependencies and/or preparations necessary to run tests (usually controlled by FEATURES=test but can be toggled independently)

app-arch / zip : Info ZIP (encryption support)

dev-java / ant : Java-based build tool similar to 'make' that uses XML configuration files

dev-java / java-config : Java environment configuration query tool

dev-java / jdom : Java-based solution for accessing, manipulating, and outputting XML data

dev-java / junit : Simple framework to write repeatable tests

virtual / jdk : Virtual for Java Development Kit (JDK)

app-arch / zip : Info ZIP (encryption support)

dev-java / java-config : Java environment configuration query tool

dev-java / jdom : Java-based solution for accessing, manipulating, and outputting XML data

virtual / jre : Virtual for Java Runtime Environment (JRE)

Repository mirror & CI · gentoo
Merge updates from master
Arthur Zamarin · gentoo
dev-java/htmlcleaner: dekeyword x86
After a message to gentoo-dev ML, and no reply from any user on x86, we decided to dekeyword most Java packages from x86, and reverse dependencies of them. Signed-off-by: Arthur Zamarin <arthurzam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Lucio Sauer · gentoo
*/*: inline mirror://sourceforge
bump copyright of touched ebuilds to 2024 Signed-off-by: Lucio Sauer <watermanpaint@posteo.net> Signed-off-by: Michał Górny <mgorny@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: switch to dev-java/ant
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/35723/commits/3d2a3e632890240c092f67ad33077c6159b4a9ae Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Repository mirror & CI · gentoo
Merge updates from master
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: drop 2.26
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/34473 Signed-off-by: Arthur Zamarin <arthurzam@gentoo.org>
Arthur Zamarin · gentoo
dev-java/htmlcleaner: Stabilize 2.29 x86, #920688
Signed-off-by: Arthur Zamarin <arthurzam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Sam James · gentoo
dev-java/htmlcleaner: Stabilize 2.29 amd64, #920688
Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: add 2.29 - CVE-2023-34624
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/33883 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: drop 2.25
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/27108 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Jakov Smolić · gentoo
dev-java/htmlcleaner: Stabilize 2.26 x86, #867805
Signed-off-by: Jakov Smolić <jsmolic@gentoo.org>
Jakov Smolić · gentoo
dev-java/htmlcleaner: Stabilize 2.26 amd64, #867805
Signed-off-by: Jakov Smolić <jsmolic@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: add 2.26
Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/26697 Signed-off-by: Florian Schmaus <flow@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Agostino Sarubbo · gentoo
dev-java/htmlcleaner: x86 stable wrt bug #816375
Package-Manager: Portage-3.0.20, Repoman-3.0.3 RepoMan-Options: --include-arches="x86" Signed-off-by: Agostino Sarubbo <ago@gentoo.org>
Agostino Sarubbo · gentoo
dev-java/htmlcleaner: amd64 stable wrt bug #816375
Package-Manager: Portage-3.0.20, Repoman-3.0.3 RepoMan-Options: --include-arches="amd64" Signed-off-by: Agostino Sarubbo <ago@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Miroslav Šulc · gentoo
dev-java/htmlcleaner: removed obsolete 2.24
Package-Manager: Portage-3.0.28, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Miroslav Šulc · gentoo
dev-java/htmlcleaner: bump to 2.25
Package-Manager: Portage-3.0.26, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Miroslav Šulc · gentoo
dev-java/htmcleaner: fixed tests
Package-Manager: Portage-3.0.18, Repoman-3.0.3 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: patched
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Closes: https://github.com/gentoo/gentoo/pull/20560 Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: remove old
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Volkmar W. Pogatzki · gentoo
dev-java/htmlcleaner: bump to 2.24
Package-Manager: Portage-3.0.18, Repoman-3.0.2 Signed-off-by: Volkmar W. Pogatzki <gentoo@pogatzki.net> Signed-off-by: Miroslav Šulc <fordfrog@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Michał Górny · gentoo
*/*: [QA] Fix trivial cases of MissingTestRestrict
The result was achieved via the following pipeline: pkgcheck scan -c RestrictTestCheck -R FormatReporter \ --format '{category}/{package}/{package}-{version}.ebuild' | xargs -n32 grep -L RESTRICT | xargs -n32 sed -i -e '/^IUSE=.*test/aRESTRICT="!test? ( test )"' The resulting metadata was compared before and after the change. Few Go ebuilds had to be fixed manually due to implicit RESTRICT=strip added by the eclass. Two ebuilds have to be fixed because of multiline IUSE. Suggested-by: Robin H. Johnson <robbat2@gentoo.org> Closes: https://github.com/gentoo/gentoo/pull/13942 Signed-off-by: Michał Górny <mgorny@gentoo.org>
Robin H. Johnson · gentoo
Drop $Id$ per council decision in bug #611234.
Signed-off-by: Robin H. Johnson <robbat2@gentoo.org>
Patrice Clement · gentoo
dev-java/htmlcleaner: Stable for amd64. Retroactively mark stable for the remaining arches using the ALLARCHES policy.
Package-Manager: portage-2.2.26
James Le Cuirot · gentoo
dev-java/htmlcleaner: JAVA_CLASSPATH_EXTRA has been renamed
Package-Manager: portage-2.2.20.1
James Le Cuirot · gentoo
dev-java/htmlcleaner: Imported and bumped from java-overlay
Closes bug #369977. Package-Manager: portage-2.2.20.1