Subject: Re: WWW query engine bug (was Query-PR)
To: der Mouse <mouse@Collatz.McRCIM.McGill.EDU>
From: None <Chris_G_Demetriou@NIAGARA.NECTAR.CS.CMU.EDU>
List: current-users
Date: 02/21/1996 17:06:29
> > (3) if the user does a 'long-range' <html>, perhaps one which
> > is never closed, how does the scanner deal with that? some
> > of the PRs are gigantic, and i think it's unreasonable to
> > have to have it parse them completely before it processes
> > any of them.
>
> I don't see why there's any need to. Your scanner just has to keep a
> bit saying whether it's inside an unclosed <HTML>...</HTML>, and if
> it's not, just do mindless mapping of < to <, etc.
... How do you check if the <html> is closed, without parsing the
entire file? (i.e. do you say that the <html></html> may only be 1,
2, N lines long? etc.)
by doing this, it becomes much harder to implement the program as a
filter...
cgd