3 <META HTTP-EQUIV=
"Content-Type" CONTENT=
"text/html; charset=iso-8859-1">
4 <META NAME=
"Author" CONTENT=
"Kipp E.B. HIckman">
5 <META NAME=
"GENERATOR" CONTENT=
"Mozilla/4.03 [en] (WinNT; I) [Netscape]">
7 <BASE HREF=
"file:///s|/ns/xena/htmlpars/testhtml/">
9 <BODY TEXT=
"#000000" BGCOLOR=
"#FFFFFF" LINK=
"#FF0000" VLINK=
"#800080" ALINK=
"#0000FF">
13 This documents describes the complete handling of HTML in magellan. The
14 document covers the parsing process - how HTML is lexically analysized
15 and then interprted. After the parsing process is discussed we give a detailed
16 analysis of each HTML tag and the attributes that are supported, the values
17 for the attributes and how the tag is treated by magellan.
20 HTML is tokenized by an HTML scanner. The scanner is fed unicode data to
21 parse. Stream converters are used to translate from various encodings to
22 unicode. The scanner separates the input stream into tokens which consist
41 conditional comments
</LI>
43 The HTML parsing engine uses the HTML scanner for lexical anlaysis. The
44 parsing engine operates by attacking the input stream in a set of well
48 The parser processes the head portion of the document first, without emitting
49 any output. This is done to discover a few special features of html:
</LI>
53 The parser processes META tags looking for META TARGET
</LI>
56 The parser processes META tags looking for META tags which affect the character
57 set. Nav4 handles the very first character set defining meta tag (all others
58 are ignored) by reloading the document with the proper character conversion
59 module inserted into the stream pipeline.
</LI>
63 After the head portion is processed the parser then proceeds to process
64 the body of the document
</LI>
69 Tags are processed by the parser by locating a
<B>"tag handler"</B> for
70 the tag. The HTML parser serves as the tag handler for all of the builtin
71 tags documented below. Tag attribute handling is done during translation
72 of tags into content. This mapping translates the tag attributes into content
73 data and into style data. The translation to style data is documented below
74 by indicating the mapping from tag attributes to their CSS1 (plus extensions)
78 The following list describes hacks added to the magellan parsing engine
79 to deal with navigator compatability. These are just the parser hacks,
80 not the layout or presentation hacks. Most hacks are intriduced for HTML
81 syntax error recovering. HTML doesn't specify much how to handle those
82 error conditions. Netscape has made big effort to render pages with non-prefect
83 HTML. For many reasons, new browsers need to keep compatible in thsi area.
86 Entities can be used as escape in quoted string. For value string in name-value
87 pair,
see
<A HREF=
"../testhtml/quote001.html">compatibility test
88 quote001.html
</A>. Test line
70 shows that an entity quote at the begining
89 means the value is NOT quoted. Test line
90 shows that if the value is
90 started with a quote, then an entity quote does NOT terminate the value
94 Wrapping tags are special tags such as title, textarea, server, script,
95 style, and etc.. The comment in ns\lib\libparse\pa_parse.c says:
</LI>
98 <BR> * These tags are special in that, after opening one of
99 them, all other tags are ignored until the matching
100 <BR> * closing tag.
102 <BR>During the searching of an end tag, comments and quoted strings are
103 observed. see
<A HREF=
"../testhtml/title01.html">compatibility test title01.html
</A>.
104 6.0 handles comments now, need to add quoted string.
106 If a
<tr
> or
<td
> tag is seen outside any
<table
> scope, it is
107 ignored. see
<A HREF=
"../testhtml/table110.htm">compatibility test table110.htm
</A>.
</LI>
110 <FONT COLOR=
"#000000">In case of table in table but not in cell, table
111 tags before the last table tag are ignored. We found this problem in some
112 Netscape public pages, see bug #
85118. For example,
<table
> <table
113 border
> .....,or
<table
> <tr
> <table border
>...,
the table
114 will be displayed with border.
</FONT> <A HREF=
"../testhtml/table201.html">compatibility
115 test table201.html
</A>. There table and tr tags are buffered for this recovery.
116 When a TD or CAPTION tag is open, the buffer is flushed out, because we
117 cannot buffer contents of TD or CAPTION for performance and memory constrains.
118 They are subdoc's and can be very big. If we see a
<table
> outside cell
119 after previous table is flushed out, the new
<table
> tag is ignored.
120 Nav4.0 can discard previous table in such case.
<A HREF=
"../testhtml/tableall.html">tableall.html
121 </A>is the index for table test cases.
</LI>
124 Caption is not a commonly used feature. In Nav4.0, captions can be anywhere.
125 For Captions outside cells, the first one takes effect. For captions inside
126 cells, the last one takes effect, and they also close TD and TR. In
6.0,
127 caption is limited to the standard position: after
<table
>. Captions
128 in other places are ignored, their contents are treated as text. See test
129 case table05a.html to table05o.html.
</LI>
132 <FONT COLOR=
"#000000">For
<table
> <tr
> <tr
>, the first
<tr
>
133 takes effect.
</FONT></LI>
136 The nav4 parser notices when it hits EOF and it's in the middle of scanning
137 in a comment. When this happens, the parser goes back and looks for an
138 improperly closed comment (e.g. a simple
> instead of a -->). If it finds
139 one, it reparses the input after closing out the comment.
</LI>
142 <FONT COLOR=
"#FF0000">XXX Brendan also pointed out that there is something
143 similar done for tags, but I don't recall what it is right now.
</FONT></LI>
146 <FONT COLOR=
"#000000">When Nav4.0 sees the '
<' sign, it searchs for
147 '
>', observing quoted values. If it cannot find one till EOF, the '
<'
148 sign is treated as text. In Xena
6.0, a limit is set for how far the '
>'
149 is searched. the default limit is
4096 char, and there is a API HTMLScanner.setMaxTagLength()
150 to changed it. setting -
1 means no limit, which is same as Nav4.0.
</FONT></LI>
152 <FONT COLOR=
"#FF0000">TODO:
</FONT>
153 <UL><FONT COLOR=
"#FF0000">Document the mapping of tag attributes into CSS1
154 style, including any new
"css1" attributes
</FONT>
156 <B>List of
6.0 features incompatible with
4.0</B>
159 Navigator
4.0 value string is truncated at
82 characters. XENA60 limit
160 is MAX_STRING_LENGTH =
2000.
</LI>
166 Tags (Categorically sorted)
</H2>
167 All line breaks are conditional. If the x coordinate is at the current
168 left margin then a soft line break does nothing. Hard line breaks are ignored
169 if the last tag did a hard line break.
171 <P><B>divalign
</B> = left | right | center | justify
172 <BR><B>alignparam
</B> = abscenter | left | right | texttop | absbottom
173 | baseline | center | bottom | top | middle | absmiddle
174 <BR><B>colorspec
</B> = named-color | #xyz | #xxyyzz | #xxxyyyzzz | #xxxxyyyyzzzz
175 <BR><B>clip
</B> = [auto | value-or-pct-xy](
1.
.4) (pct of width for even
176 coordinates; pct of height for odd coordinates)
177 <BR><B>value-or-pct =
</B>an integer with an optional %; ifthe percent
178 is present any following characters are ignored!
179 <BR><B>coord-list
</B> =
<FONT COLOR=
"#DD0000">XXX
</FONT>
180 <BR><FONT COLOR=
"#000000"><B>whitespace-strip
</B> = remove leading and
181 trailing and any embedded whitespace that is not an actual space (e.g.
186 <UL>The TITLE tag is a container tag whose contents are not HTML. The contents
187 are pure text and are processed by the parser until the closing tag is
188 found. There are no attributes on the tag and any whitespace present in
189 the tag is compressed down with leading and trailing whitespace eliminated.
190 The first TITLE tag found by the parser is used as the document's title
191 (subsequent tags are ignored).
</UL>
193 <UL>Sets the base element in the head portion of the document. Defines
194 the base URL for
<FONT COLOR=
"#DD0000">all
</FONT>? links in the document.
196 <UL><B>HREF
</B>=url [This is an absolute URL]
197 <BR><B>TARGET
</B>=string [must start with XP_ALPHA|XP_DIGIT|underscore
198 otherwise nav4 ignores it]
</UL>
201 <UL>Can define several header fields (content-encoding, author, etc.)
203 <UL><B>REL
</B>=SMALL_BOOKMARK_ICON|LARGE_BOOKMARK_ICON
204 <UL><B>SRC
</B>=string
</UL>
205 <B>HTTP-EQUIV
</B>=
"header: value"
206 <UL><B>CONTENT
</B>=string
</UL>
208 HTTP-EQUIV values (from libnet/mkutils.c NET_ParseMimeHeader):
210 <BR>CONTENT-DISPOSITION
213 <BR>CONTENT-TYPE [ defines character set only ]
221 <BR>PROXY-AUTHENTICATE
229 <BR>WWW-PROTECTION-TEMPLATE
230 <BR>WINDOW-TARGET
</UL>
231 Style sheets and HTML w3c spec adds this:
232 <UL>CONTENT-STYLE-TYPE [ last one wins; overrides header from server if
236 <UL>List related resources. Used by extensions mechanism to find tag handlers.
237 <FONT COLOR=
"#0000FF">/LINK == LINK!
</FONT>
239 <UL><B>REL
</B>=FONTDEF
240 <UL><B>SRC
</B>=url
</UL>
241 <B>REL
</B>=STYLESHEET [ If MEDIA param is defined it must ==nc screen ]
242 <UL><B>LANGUAGE
</B>=LiveScript|Mocha|JavaScript1.1|JavaScript1.2
243 <BR><B>TYPE
</B>=
"text/javascript" |
"text/css"
245 <BR><B>ARCHIVE
</B>=url
246 <BR><B>CODEBASE
</B>=url
248 <BR><B>SRC
</B>=url
</UL>
250 Note: HREF takes precedence over SRC in nav4.
</UL>
252 <UL>/HEAD clears the
"in_head" flag (but leaves the
"in_body" flag alone.
253 <BR>Text in head clears in_head, and set in_body true, just as if the author
254 forgot the /HEAD tag.
255 <BR>Attributes: none
</UL>
258 <BR>Attributes: none
</UL>
260 <UL>Allowed anywhere in the document. Note that entities are not parsed
261 in the style tag's content.
263 <UL><B>LANGUAGE
</B>=LiveScript|Mocha|JavaScript1.1|JavaScript1.2
264 <BR><B>TYPE
</B>=
"text/javascript" |
"text/css"
266 <BR><B>ARCHIVE
</B>=url
267 <BR><B>CODEBASE
</B>=url
269 <BR><B>SRC
</B>=url
</UL>
272 <UL>Frameset with rows=
1 and cols=
1 is ignored.
274 <UL><B>FRAMEBORDER
</B>= no |
0 (zero) [default is no_edges=false]
275 <BR><B>BORDER
</B>= int [clamped:
>=
0 && <=
100]
276 <BR><B>BORDERCOLOR
</B>= color
277 <BR><B>ROWS
</B>= pct-list
278 <BR><B>COLS
</B>= pct-list
</UL>
281 <UL>Border width of zero disables edges.
283 <UL><B>FRAMEBORDER
</B>= no |
0 (zero) [default is framesets value]
284 <BR><B>BORDER
</B>= int [clamped;
>=
0 && <=
100]
285 <BR><B>BORDERCOLOR
</B>= color
286 <BR><B>NORESIZE
</B>= true [default is false]
287 <BR><B>SCROLLING
</B>= yes | scroll | on | no | noscroll | off
288 <BR><B>SRC
</B>= url [clamped: prevent recursion by eliminating any anscestor
290 <BR><B>NAME
</B>= string
291 <BR><B>MARGINWIDTH
</B>= int (clamped:
>=
1)
292 <BR><B>MARGINHEIGHT
</B>= int (clamped:
>=
1)
</UL>
295 <UL>Used when frames are disabled or for backrev browsers. Has no stylistic
300 <HR WIDTH=
"100%">Body objects:
</H1>
302 <UL>The tag is only processed on open tags and it is always processed.
303 See ns\lib\layout\laytags.c, searching for
"case P_BODY". During tag processing
304 the in_head flag is set to false and the in_body flag is set to true. An
305 attribute is ignored if the document already has that attribute set. Attributes
306 can be set by style sheets, or by previous BODY tags. see
<A HREF=
"../testhtml/head02.html">test
309 <UL><B>MARGINWIDTH
</B>=int [clamped:
>=
0 && < (windowWidth/
2
311 <BR><B>MARGINHEIGHT
</B>=int [clamped:
>=
0 && < (windowHeight/
2
313 <BR><B>BACKGROUND
</B>=url
314 <BR><B>BGCOLOR
</B>=colorspec
315 <BR><B>TEXT
</B>=colorspec
316 <BR><B>LINK
</B>=colorspec
317 <BR><B>VLINK
</B>=colorspec
318 <BR><B>ALINK
</B>=colorspec
319 <BR><B>ONLOAD, ONUNLOAD, UNFOCUS, ONBLUR, ONHELP
</B>=script
320 <BR><B>ID
</B>=string
</UL>
323 <UL>Open layer/ilayer tag automaticly close out an open form if one is
324 open. It does something to the soft linebreak state too.
326 <UL><B>LEFT
</B>=value-or-pct (pct of
<TT>right-left
</TT> margin)
327 <BR><B>PAGEX
</B>=x (if no LEFT)
328 <BR><B>TOP
</B>=value-or-pct
329 <BR><B>PAGEY
</B>=y (if no TOP)
331 <BR><B>WIDTH
</B>=value-or-pct (pct of
<TT>right-left
</TT> margin)
332 <BR><B>HEIGHT
</B>=value-or-pct
333 <BR><B>OVERFLOW
</B>=string
334 <BR><B>NAME
</B>=string
336 <BR><B>ABOVE
</B>=string
337 <BR><B>BELOW
</B>=string
338 <BR><B>ZINDEX
</B>=int [any value]
339 <BR><B>VISIBILITY
</B>=string
340 <BR><B>BGCOLOR
</B>=colorspec
341 <BR><B>BACKGROUND
</B>=url
</UL>
344 <UL>Container for content which is used when layers are disabled or unsupported.
345 The content has no style consequences (though it could if somebody stuck
346 in some CSS1 style rules for it).
</UL>
348 <UL>Closes the paragraph. If the attribute is present then an alignment
349 gets pushed on the alignment stack. All values are supported by nav4.
351 <UL><B>ALIGN
</B>=divalign
</UL>
354 <UL>There are no attributes. ADDRESS closes out the open paragraph. The
355 open tag does a conditional soft line break and then pushes a merge of
356 the current style with italics enabled onto the style stack. The close
357 always pop the style stack and also does a conditional soft line break.
</UL>
358 <B>PLAINTEXT, XMP
</B>
359 <UL>PLAINTEXT causes the remaining content to no longer be parsed. XMP
360 causes the content to not parse entities or other tags. The XMP can be
361 closed by it's own tag (on any boundary); PLAINTEXT is not closed (html3.2
362 allows it to be closed). Both tags change the style to a fixed font of
365 <UL>Closes the paragraph. Does a hard line break on open and close. Open
366 pushes a fixed width font style of a particular font size on the style
367 stack. The close tag pops the top of the style stack.
368 <BR>Attributes: none
</UL>
370 <UL>Closes the paragraph. The open tag does a hard line break. A fixed
371 font style (unless VARIABLE is present) is pushed on the style stack. The
372 close tag pops the top of the style stack. It also does a hard line break.
375 <BR><B>COLS
</B>=int [clamped:
>=
0]
376 <BR><B>TABSTOP
</B>=int [clamped:
>=
0; clamped value is replaced with default
378 <BR><B>VARIABLE
</B></UL>
381 <UL>This tag doesn't nest. Instead it just sets or clears a flag in the
382 state machine. It has no effect on any other state.
</UL>
384 <UL>Closes the paragraph. Always does a conditional soft line break. The
385 open tag pushes an alignment on the aligment stack. The close tag pops
386 the top alignment off.
387 <BR>Attributes: none
</UL>
389 <UL>Closes the paragraph. Always does a conditional soft line break. COLS
390 defines the number of columns to layout in (like MULTICOL). The open tag
391 pushes an alignment on the alignment stack (if COLS
> 1 then it pretends
392 to be a MULTICOL tag). The close tag pops an aligment from the alignment
395 <UL><B>ALIGN
</B>=divalign
396 <BR><B>COLS
</B>=int [if cols
> 1 then DIV acts like a MULTICOL tag else
397 DIV is just a container]
398 <UL><B>GUTTER
</B>= int (clamped:
>=
1)
399 <BR><B>WIDTH
</B>= value-or-pct [pct of right-left margin; clamped
>=
1/
0
404 <UL>Closes the paragraph. The open tag does a hard line break and pushes
405 a style item which enables bold and disables fixed and italic. The close
406 tag always pops the top item from the style stack. It also does a hard
407 line break. If the
<B>ALIGN
</B> attribute is present then the open tag
408 pushes an alignment on the alignment stack. The close tag will look at
409 the top of the alignment stack and if its a header of any kind (H1 through
410 H6) then the alignment is popped. In either case the close tag also does
411 a conditional soft line break (this happens before the hard line break).
413 <UL><B>ALIGN
</B>=divalign
</UL>
415 A note regarding closing paragraphs: Any time a close paragraph is done
416 (for any tag) if the top of the alignment stack has a tag named
"P" then
417 a conditional soft line break is done and the alignment is popped.
420 <HR ALIGN=LEFT
WIDTH=
"50%"></H3>
422 <UL>Close the paragraph.
424 <UL><B>ALIGN=
</B>left|right|center|abscenter
425 <BR><B>BORDER
</B>=int [clamped: if null then -
1, if
< 1 then
1 ]
426 <BR><B>BORDERCOLOR
</B>=string [if not supplied then set to the text color
428 <BR><B>VSPACE
</B>=int [ clamped:
>=
0 ]
429 <BR><B>HSPACE
</B>=int [ clamped:
>=
0 ]
430 <BR><B>BGCOLOR
</B>=color
431 <BR><B>BACKGROUND
</B>=url
432 <BR><B>WIDTH
</B>=value-or-pct [ % of win.width minus margins; clamped:
434 <BR><B>HEIGHT
</B>=value-or-pct [ % of win.height minus margins; clamped:
436 <BR><B>CELLPADDING
</B>=int [clamped:
>=
0; separate pads take precedence
438 <BR><B>TOPPADDING
</B>= int [clamped:
>=
0 ]
439 <BR><B>BOTTOMPADDING
</B>= int [clamped:
>=
0 ]
440 <BR><B>LEFTPADDING
</B>= int [clamped:
>=
0 ]
441 <BR><B>RIGHTPADDING
</B>= int [clamped:
>=
0 ]
442 <BR><B>CELLSPACING
</B>= int [clamped:
>=
0 ]
443 <BR><B>COLS
</B>=int [clamped:
>=
0]
</UL>
444 The code supports more attributes in the Table attribute handler than it
445 does in the code that gets the attributes from the tag! They are border_top,
446 border_left, border_right, border_bottom, border_style (defaults to outset;
447 allows for outset/dotted/none/dashed/solid/double/groove/ridge/inset).
</UL>
449 <UL>Open TR automatically closes an open table row (and an open table cell
450 if one is open). It also automatically closes a CAPTION tag.
452 <UL><B>BGCOLOR
</B>=color
453 <BR><B>BACKGROUND
</B>=url
454 <BR><B>VALIGN
</B>=top|bottom|middle|center(==middle)|baseline; default
456 <BR><B>ALIGN
</B>=left|right|middle|center(==middle); default is left
</UL>
459 <UL>If no table then the tag is ignored (open or close). If no row is currently
460 opened or the current row is current done (because of a
</TR
> tag) then
461 a new row is begun. Oddly enough the tag parameters for the row come from
462 the TH/TD tag in this case. An open of either of these tags will automatically
463 close the previous cell.
465 <UL><B>COLSPAN
</B>=int [clamped:
>=
1 && <=
1000 ]
466 <BR><B>ROWSPAN
</B>=int [clamped:
>=
1 && <=
10000 ]
467 <BR><B>NOWRAP
</B> [boolean: disables wrapping ]
468 <BR><B>BGCOLOR
</B>=color [default: inherit from the row; if not row then
469 table; if not table then inherit from an outer table cell; this works because
470 the style is flattened so the outer table cell will have a color]
471 <BR><B>BACKGROUND
</B>=url [same rules as bgcolor for inheritance; tile
472 mode is inherited too and not settable by TH/TD attributes (have to use
473 style sheets for that)]
474 <BR><B>VALIGN
</B>=top|bottom|middle|center(==middle)|baseline; default
476 <BR><B>ALIGN
</B>=left|right|middle|center(==middle); default is left
477 <BR><B>WIDTH
</B>=value-or-pct [ clamped:
>=
0 ]
478 <BR><B>HEIGHT
</B>=value-or-pct [ clamped:
>=
0 ]
</UL>
481 <UL>An open caption tag will automatically close an open table row (and
484 <UL><B>ALIGN
</B>=bottom
</UL>
485 The code sets the vertical alignment to top w/o providing a mechanism for
486 the user to set it (there is no VALIGN attribute).
</UL>
488 <UL>The open tag does a hard line break. The close tag checks to see if
489 the state machine has an open multicol and if it does then it does a conditional
490 soft line break and then continues to break until both margins are cleared
491 of floating elements. It recomputes the margins based on the list indenting
492 level (?). After the synthetic table is output the close tag does a hard
495 <P>This tag will treat the input as source for a table with one row and
496 COLS columns. The data is laid out using the width divided by the number
497 of columns. After the total height is known, the content is partitioned
498 as evenly as possible between the columns in the table.
500 <UL><B>COLS
</B>=int [clamped: values less than
2 cause the tag to be ignored]
501 <BR><B>GUTTER
</B>=int [clamped:
>=
1]
502 <BR><B>WIDTH
</B>=value-or-pct [pct of right-left margin; clamped:
>=
1/
0
508 <HR ALIGN=LEFT
WIDTH=
"50%"></H3>
510 <UL>Closes the paragraph. The open tag does a hard line break. A list with
511 the empty-bullet style is pushed on the list stack (unless TYPE=cite/jwz
512 then a styled list is pushed). The close tag pops any list and does a hard
515 <UL><B>TYPE
</B>=cite | jwz
</UL>
517 <B>UL, OL, MENU, DIR
</B>
518 <UL>For top-level lists (lists not in lists) a hard break is done on the
519 open tag, otherwise a conditional-soft-break is done. Tag always does a
520 close paragrah. The close tag does a conditional soft line break when nested;
521 when not nested the close tag does a hard line break (even if no list is
522 open). The open tag pushes the list on the list stack. The close tag pops
523 any list off the list stack.
525 <UL><B>TYPE
</B>= none | disc | circle | round | square | decimal | lower-roman
526 | upper-roman | lower-alpha | upper-alpha | A | a | I | i [clamped: if
527 none of the above is picked and OL then the bullet type is
"number" otherwise
528 the bullet type is
"basic"]
529 <BR><B>START
</B>=int [clamped:
>=
1]
530 <BR><B>COMPACT
</B></UL>
533 <UL>Closes the paragraph. For the open tag, if the list is nested then
534 a conditional soft line break is done otherwise a hard line break is done.
535 The open tag pushes a list on the list stack. The close tag pops any list
536 from the list stack. Closing the list acts like other lists closes.
538 <UL><B>COMPACT
</B></UL>
541 <UL>Closes the paragraph. The open tag does a conditional soft line break.
542 Close tags are ignored (except for closing the paragraph).
544 <UL><B>TYPE
</B>= A | a | I | i (if the containing list is an
<B>OL
</B>)
545 <BR><B>TYPE
</B>= round | circle | square (if the containing list is not
546 <B>OL
</B> and not
<B>DL
</B>)
547 <BR><B>VALUE
</B>=int [clamped:
>=
1]
</UL>
548 The magellan html parser allows the full set of list item styles from the
549 OL/DL tag instead of just the limited set that nav4 allows.
</UL>
551 <UL>Closes the paragraph. Close tags are ignored (except for closing the
552 paragraph). DD outside a DL just advances the X coordinate of layout by
553 a small constant. DD inside a DL does a conditional soft line break and
555 <BR>Attributes: none.
</UL>
557 <UL>Closes the paragraph (open or close). Close tags are otherwise ignored.
558 Does a conditional soft line break. Moves the X layout coordinate to the
560 <BR>Attributes: none
</UL>
564 <HR ALIGN=LEFT
WIDTH=
"50%"></H3>
566 <UL>Open anchors push a style on the style stack if the anchor has an
<B>HREF
</B>.
567 Close anchors pop as many styles off the top of the style stack that are
568 anchor tags (anchor tags don't nest in other words). In addition, any styles
569 on the stack that have the ANCHOR bit set have it cleared and fiddle with
570 the foreground and background colors.
572 <UL><B>NAME
</B>=string
574 <UL><B>TARGET
</B>=target
575 <BR><B>SUPPRESS
</B>=true
</UL>
578 <B>STRIKE, S, TT, CODE, SAMPLE, KBD, B, STRONG, I, EM, VAR, CITE, BLINK,
579 BIG, SMALL, U, INLINEINPUT, SPELL
</B>
580 <UL>The open tag pushes onto the style stack. The close tag always pops
581 the top item from the style stack.
582 <BR>Attributes: none
</UL>
584 <UL>The open tag pushes a font size descrease on the style stack. The close
585 tag always pops the top of the style stack. The open and close tag impacts
586 the baselineThe only difference between SUP and SUB is how they impact
587 the baseline. Note that the baseline information is forgotten after a line
588 break; therefore a close SUP/SUB on the next line will do strange things.
589 <BR>Attributes: none
</UL>
591 <UL>Ignored by the navigator.
592 <BR>Attributes: none
</UL>
594 <UL>The open font tag with no attributes resets the font size to the base
595 font size. The open tag always pushes a style stack entry. The close tag
596 always pops the top item off the style stack.
598 <UL><B>SIZE
</B>=[+ int | - int | int ]
[clamped:
>=
1 && <=
600 <BR><B>POINT-SIZE=
</B>[+ int | - int | int ] [clamped:
>=
1 &&
602 <BR><B>FONT-WEIGHT
</B>=[+ int | - int | int ] [clamped:
>=
100 &&
604 <BR><B>COLOR
</B>=colorspec
605 <BR><B>FACE
</B>=string
</UL>
607 A note regarding the style stack: The pop of the stack checks to see if
608 the top of the stack is an ANCHOR tag. If it is not an anchor then the
609 top item is unconditionally popped. If the top of the style stack is an
610 anchor tag then the code searches for either the bottom of the stack or
611 the first style stack entry not created by an anchor tag. If the entry
612 is followed by another entry then the entry is removed from the stack (an
613 out-of-order pop in other words). In this case the anchor style stack entry
617 <HR ALIGN=LEFT
WIDTH=
"50%"></H3>
618 <B>text, entities
</B>
619 <UL>These are basic content objects that get fed directly to the output.
620 In navigator the text is processed by doing line-breaking (entities have
621 been converted to text already by the parser). The line-breaking is controlled
622 by the margin settings and the list depth, the floating elements, the style
623 attributes (font size, etc.), the preformatted flag, the no-break flag
626 <UL>Close tag is ignored.
629 <BR><B>USEMAP
</B>=url
630 <BR><B>ALIGN
</B>=alignparam
631 <BR><B>SRC
</B>=url [ whitespace is stripped ]
632 <BR><B>LOWSRC
</B>=url
633 <BR><B>ALT
</B>=string
634 <BR><B>WIDTH
</B>=value-or-pct (pct of
<TT>right-left
</TT> width)
635 <BR><B>HEIGHT
</B>=value-or-pct (pct of window height)
636 <BR><B>BORDER
</B>=int [clamped:
>=
0]
637 <BR><B>VSPACE
</B>=int [clamped:
>=
0]
638 <BR><B>HSPACE
</B>=int [clamped:
>=
0]
639 <BR><B>SUPPRESS
</B>=true | false (only in blocked image layout???)
</UL>
642 <UL>Closes the paragraph. If an open tag then does a conditional soft line
643 break. The rule inherits alignment from the parent container unless there
644 is no container (then it's centered) or if the tag defines it's own alignment.
645 After the object is inserted into the layout stream a soft line break is
648 <UL><B>ALIGN
</B>=divalign (sort of; in laytags.c it's divalign; in layhrule.c
649 it's left or right only)
650 <BR><B>SIZE
</B>=int (
1 to
100 inclusive)
651 <BR><B>WIDTH
</B>=val-or-pct (pct of
<TT>right-left
</TT> width)
652 <BR><B>NOSHADE
</B></UL>
655 <UL>Does an unconditional soft break. If clear is set then it will also
656 soft break until either the left or right or both margins are clear of
657 floating elements. Note that
<FONT COLOR=
"#0000FF"> /BR == BR!
</FONT>
659 <UL><B>CLEAR
</B>=left | right | all | both
</UL>
663 <BR>Attributes: none
</UL>
665 <UL>Close tag does nothing. Embed's operate inline just like images (they
666 don't close the paragraph).
668 <UL><B>HIDDEN
</B>=no | false | off
669 <BR><B>ALIGN
</B>=alignparam
671 <BR><B>WIDTH
</B>=val-or-pct (pct of
<TT>right-left
</TT> width)
672 <BR><B>HEIGHT
</B>=val-of-pct; if val is
< 1 (sometimes) the element
673 gets HIDDEN automatically
674 <BR><B>BORDER
</B>=int (unsupported by navigator)
675 <BR><B>VSPACE
</B>=int [clamped:
>=
0]
676 <BR><B>HSPACE
</B>=int [clamped:
>=
0]
</UL>
679 <UL>Used when EMBED's are disabled. It is a container for regular content
680 that has no stylistic consequences (no line breaking, no style stack effect,
683 <UL>Applet tags don't nest (there is a notion of current_applet). The open
684 tag automatically closes an open applet tag.
686 <UL><B>ALIGN
</B>=alignparam
687 <BR><B>CODE
</B>=string
688 <BR><B>CODEBASE
</B>=string
689 <BR><B>ARCHIVE
</B>=string
691 <BR><B>NAME
</B>=string [clamped: white space is stripped out]
692 <BR><B>WIDTH
</B>=value-or-pct [pct of right-left width; clamped:
>=
1]
693 <BR><B>HEIGHT
</B>=value-or-pct [pct of window height; clamped
>=
1]
694 <BR><B>BORDER
</B>=int [clamped:
>=
0]
695 <BR><B>HSPACE
</B>=int [clamped:
>=
0]
696 <BR><B>VSPACE
</B>=int [clamped:
>=
0]
</UL>
697 If no width is provided:
698 <UL>if a height was provided, use the height. Otherwise, use
90% of the
699 window width if percentage widths are allowed, otherwise use a value of
702 If no height is provided:
703 <UL>if a width was provided, use the width. Otherwise, use
50% of the window
704 height if percentage widths are allowed, otherwise use a value of
400.
</UL>
705 If the applet is hidden, then the widht/height get forced to zero.
</UL>
707 <UL>The param tag is supported when contained by the APPLET tag or the
708 OBJECT tag. It has no stylistic consequences. The attribute values from
709 the tag are passed to the containing APPLET or OBJECT. Note that
<FONT COLOR=
"#0000FF">/PARAM
712 <UL><B>NAME
</B>=string [clamped: white space is stripped out]
713 <BR><B>VALUE
</B>=string [clamped: white space is stripped out]
</UL>
714 White space being stripped is done as follows: leading and trailing whitespace
715 is removed. Any embedded whitespace is left alone except if it's a non-space
716 whitespace in which case it is removed.
</UL>
718 <UL>The open tag pushes an object onto the object stack. The close tag
719 pops from the object stack. I don't understand how the data stuff works.
721 <UL><B>CLASSID
</B>=string (clsid:, java:, javaprogram:, javabean: are the
722 supported prefixes; maybe it's a url if no prefix shown?)
723 <BR><B>TYPE
</B>=string (a mime type)
724 <BR><B>DATA
</B>=string (data: prefix mentions a url)
</UL>
725 There are more attributes that depend on the type of object being embedded
726 in the page. If the object is a java bean (?) then the applet parameters
731 <BR>CLASSID (instead of CODE)
735 <BR>ID (applets use NAME)
743 <UL>The open tag automatically closes an open map (maps don't nest). There
744 is no stylistic consequence of the map nor does it provide any visible
745 presentation in the normal layout case (an editor would do something different).
746 The map can be declared anywhere in the document.
748 <UL><B>NAME
</B>=string [clamped: white space is stripped out]
</UL>
751 <UL>Does nothing if there is no current map or the tag is a close tag.
753 <UL><B>SHAPE
</B>=default | rect | circle | poly | polygon
754 <BR><B>ALT
</B>=string [clamped: newlines are stripped]
755 <BR><B>COORDS
</B>=coord-list
757 <UL><B>TARGET
</B>=target (only if HREF is specified)
</UL>
761 <UL>A container for server-side javascript. Not evaluated by the client
762 (parsed and ignored). Note: The navigator parser doesn't expand entities
763 in a
<B>SERVER
</B>tag.
</UL>
765 <UL>Close tag is ignored. Open tag provides whitespace during layout:
<B>TYPE
</B>=line/vert/vertical
766 causes a conditional soft line break and then adds
<B>SIZE
</B>to the Y
767 layout coordinate.
<B>TYPE
</B>=word causes a conditional soft word break
768 and then adds
<B>SIZE
</B>to the X layout coordinate.
<B>TYPE
</B>=block
769 causes
<FONT COLOR=
"#DD0000">blockish
</FONT>layout stuff to happen.
771 <UL><B>TYPE
</B>=line | vert | vertical | block (default: word)
772 <UL><B>ALIGN
</B>=alignparam (these
3 params are only for
<B>TYPE
</B>=block)
773 <BR><B>WIDTH
</B>=value-or-pct
774 <BR><B>HEIGHT
</B>=value-or-pct
</UL>
775 <B>SIZE
</B>=int [clampled:
>=
0]
</UL>
780 <HR ALIGN=LEFT
WIDTH=
"50%"></H3>
782 <UL>Note: The navigator parser doesn't expand entities in a SCRIPT tag.
784 <UL><B>LANGUAGE
</B>=LiveScript | Mocha | JavaScript1.1 | JavaScript1.2
785 <BR><B>TYPE
</B>=
"text/javascript" |
"text/css"
787 <BR><B>ARCHIVE
</B>=url
788 <BR><B>CODEBASE
</B>=url
790 <BR><B>SRC
</B>=url
</UL>
793 <UL>Used when scripting is off or by backrev browsers. It is a container
794 that has no stylistic consequences.
</UL>
798 <HR ALIGN=LEFT
WIDTH=
"50%"></H3>
801 <UL><B>ACTION
</B>=href
802 <BR><B>ENCODING
</B>=string
803 <BR><B>TARGET
</B>=string
804 <BR><B>METHOD
</B>=get | post
</UL>
807 <UL>This tag is a shortcut for creating a form element with a submit button
808 and a single text field. If the PROMPT attribute is not present in the
809 tag then the value used is
<B>"This is a searchable index. Enter search
813 <UL><B>PROMPT
</B>=string
814 <BR><B>ACTION
</B>=href
815 <BR><B>ENCODING
</B>=string
816 <BR><B>TARGET
</B>=string
817 <BR><B>METHOD
</B>=get | post
</UL>
820 <UL>Attributes vary according to type:
821 <UL><B>TYPE
</B>= text | radio | checkbox | hidden | submit | reset | password
822 | button | image | file | jot | readonly | object
823 <BR><B>NAME
</B>= string
826 <UL>attributes are from the IMG tag (!)
</UL>
827 <B>TYPE
</B>= text | password | file
828 <UL>font style is forced to fixed
829 <BR><B>VALUE
</B>= string
830 <BR><B>SIZE
</B>= int (clamped;
>=
1)
831 <BR><B>MAXLENGTH
</B>= int (not clamped!)
</UL>
832 <B>TYPE
</B>= submit | reset | button | hidden | readonly
833 <UL><B>VALUE
</B>=string; default if no value to the attribute varies according
835 <UL><B>submit
</B> -
> "Submit Query"
836 <BR><B>reset
</B> -
> "Reset"
837 <BR>others -
> " " (
2 spaces)
838 <BR>Note also that the value has newlines stripped from it
</UL>
839 <B>WIDTH
</B>=int (clamped
>=
0 && <=
1000) (only for submit,
841 <BR><B>HEIGHT
</B>=int (clamped
>=
0 && <=
1000) (only for submit,
842 reset or button)
</UL>
843 <B>TYPE
</B>=radio | checkbox
844 <UL><B>CHECKED
</B> (flag - if present then set to true)
845 <BR><B>VALUE
</B>= string (the default value is
"on")
</UL>
849 <UL><B>MULTIPLE
</B> (boolean)
850 <BR><B>SIZE
</B>= int (clamped
>=
1)
851 <BR><B>NAME=
</B> string
852 <BR><B>WIDTH
</B>= int (clampled
>=
0 && <=
1000)
853 <BR><B>HEIGHT
</B>= int (clamped
>=
0 && <=
1000; only examined
854 for single entry lists (!multiple || size==
1))
</UL>
857 <UL>Lives inside the SELECT tag (ignored otherwise).
859 <UL><B>VALUE
</B>=string
860 <BR><B>SELECTED
</B> boolean
</UL>
862 <B>TEXTAREA
</B>
864 <UL><B>NAME
</B>=string
865 <BR><B>ROWS
</B>=int (clamped;
>=
1)
866 <BR><B>COLS
</B>=int (clamped;
>=
1)
867 <BR><B>WRAP
</B>= off | hard | soft (default is off; any value which is
868 not known turns into soft)
</UL>
872 <UL><B>NAME
</B>=string
873 <BR><B>CHALLENGE
</B>=string
874 <BR><B>PQG
</B>=string
875 <BR><B>KEYTYPE
</B>=string
</UL>
880 <HR ALIGN=LEFT
WIDTH=
"50%"></H3>
881 <B>BASEFONT
</B>
882 <UL>Sets the base font value which +/- size values in FONT tags are relative
885 <UL>SIZE=+ int | - int | int (just like FONT)
</UL>
890 <HR WIDTH=
"100%">Unsupported
</H2>
891 <B>NSCP_CLOSE, NSCP_OPEN, NSCP_REBLOCK, MQUOTE, CELL, SUBDOC, CERTIFICATE,
892 INLINEINPUTTHICK, INLINEINPUTDOTTED, COLORMAP, HYPE, SPELL, NSDT
</B>
893 <UL>These tags are unsupported because they are used internally by netscape
894 and are never seen in real content. If somebody does use them between
4.0
895 and magellan, tough beans. We never documented them so they lose.
</UL>