HTML

See below to parse HTML input or format HTML output.

Note

The HTML is not validated.

Setting Stream Properties

You can set the following properties to parse the format of HTML input or to format HTML output. Select the necessary component and expand the Format property in the Stream pane.

Property name Value
Output Encoding Select the stream's encoding.
utf-8 Unicode UTF-8
shift_jis Shift JIS
euc-jp EUC-JP
iso-2022-jp ISO-2022-JP
utf-16 Unicode UTF-16
Windows-31J Windows-31J

Connecting HTML Input and Output Streams

HTML streams don't have field definitions. They contain a single field, the stream itself. In the mapper, you connect an HTML stream as a single field.