CHAPTER 5 — Scanners with One Character Pushback

The scanner in the previous chapter worked by reading characters one by one, processing each one before reading the next. It was always able to process the character it had just read. Often with scanners this is not the case. Sometimes a scanner must read one character beyond the end of a token in order to determine that the token has ended. This chapter shows how scanners do this.

Chapter Topics:

When one character push-back is needed
How to implement push-back by extending BufferedReader
The Java class PushbackReader
An example scanner using push-back

QUESTION 1:

What are the tokens in the following section of a HTML file?

<h1>Important Heading</h1>
<p>
<span style="color:blue">Many words</span>
</p>

Regard each complete HTML tag as a token.