Remove SGML markup.
stripsgml removes SGML markup tags from a file. stripsgml also attempts to translate standard special character entity references to ASCII (or the character set specified by the -charset option).
stripsgml cannot handle document type declarations. It only can handle document instances.
stripsgml is invoked from the Unix shell as follows:
% stripsgml [options] < sgmlfile > asciifile
The following are the list of options available:
-charset
charsetUse charset as the character set to use while processing. By default, ASCII is assumed, so entity references for special characters and character references are mapped to ASCII text. set_charset allows the entity references and character references to interpreted under a different character set. Only the ISO-8859 character sets (1-10) are supported.
-help
Print out a terse description of all options available.
-html
Cause URLs in anchor elements in an HTML document to be preserved in the output.
-ignore
nameSet parameter entity with the name, name, to "IGNORE". This option may be useful if the input data contains marked sections where parameter entity references are used to control the status keyword.
This option can be specified multiple times to set multiple parameter entities.
-include
nameSet parameter entity with the name, name, to "INCLUDE". This option may be useful if the input data contains marked sections where parameter entity references are used to control the status keyword.
This option can be specified multiple times to set multiple parameter entities.
This software is part of the perlSGML package; see (http://www.oac.uci.edu/indiv/ehood/perlSGML.html)