Zip Component, Email Component, Encryption Component ActiveX Control for Zip Compression .NET Components for ASP.NET
ActiveX and .NET Components for Zip Compression, Encryption, Email, XML, S/MIME, HTML Email, Character Encoding, Digital Certificates, FTP, and more ASP Email ActiveX Component

  

  

  Chilkat ActiveX Components

  Chilkat .NET Components

  Chilkat C++ Libraries

  

  

  

  

 

FAQ

Java HTML-to-XML / HTML Parser Library

Download Chilkat Java Library

Download Chilkat Java x64 Library

Java Library Reference Docs · Purchase · License · Java Examples

HTML to XML Conversion Java Library.

The Chilkat HTML-to-XML Java library is designed for the purpose of transforming HTML into well-formed XML for parsing. If effect, it is designed to be an HTML parser / scraper. Once HTML is converted to XHTML (i.e. well-formed XML), the plethora of existing XML parsing components and libraries can be leveraged for HTML parsing and scraping.

  • File-to-file HTML to XML conversion.
  • Memory-to-memory HTML to XML conversion.
  • Convert character encoding during conversion process.
  • Flexibility in controlling how HTML entities are handled.
  • Automatically convert HTML entities to corresponding 8-bit characters.
  • Optionally drop all text formatting tags from the output.
  • Drop/undrop specific tags from the output.

HTML / XML Examples


Privacy Statement. Copyright 2000-2010 Chilkat Software, Inc. All rights reserved.
Send feedback to support@chilkatsoft.com

Components for Microsoft Windows 7, Vista, XP, 2000, 2003 Server, and Windows 95/98/NT4.