Computer Science Engineering (CSE) Exam  >  Computer Science Engineering (CSE) Notes  >  Compiler Design  >  Input Buffer & Lexical Analyzer Generator

Input Buffer & Lexical Analyzer Generator | Compiler Design - Computer Science Engineering (CSE) PDF Download

Lexical Analyzer Generator:

Creating a lexical analyzer with Lex:

  • First, a specification of a lexical analyzer is prepared by creating a program lex.l in the Lex language. Then, lex.l is run through the Lex compiler to produce a C program lex.yy.c.
  • Finally, lex.yy.c is run through the C compiler to produce an object program a.out, which is the lexical analyzer that transforms an input stream into a sequence of tokens

                                                       Input Buffer & Lexical Analyzer Generator | Compiler Design - Computer Science Engineering (CSE)
 

Lex Specification

A Lex program consists of three parts:

{ definitions } %%
{ rules } %%
{ user subroutines }
o Definitions include declarations of variables, constants, and regular definitions
o Rules are statements of the form p1 {action1}p2 {action2} … pn {action}
o where pi is regular expression and actioni describes what action the lexical analyzer should take when pattern pi matches a lexeme. Actions are written in C code.
o User subroutines are auxiliary procedures needed by the actions. These can be compiled separately and loaded with the lexical analyzer.

Question for Input Buffer & Lexical Analyzer Generator
Try yourself:
What is the purpose of a lexical analyzer generator?
View Solution


INPUT BUFFERING The LA scans the characters of the source program one at a time to discover tokens. Because of large amount of time can be consumed scanning characters, specialized buffering techniques have been developed to reduce the amount of overhead required to process an input character.
 

Buffering techniques:
1. Buffer pairs
2. Sentinels
 

The lexical analyzer scans the characters of the source program one a t a time to discover tokens. Often, however, many characters beyond the next token many have to be examined before the next token itself can be determined. For this and other reasons, it is desirable for the lexical analyzer to read its input from an input buffer. Figure shows a buffer divided into two halves of, say 100 characters each. One pointer marks the beginning of the token being discovered. A look ahead pointer scans ahead of the beginning point, until the token is discovered .we view the position of each pointer as being between the character last read and the character next to be read. In practice each buffering scheme adopts one convention either a pointer is at the symbol last read or the symbol it is ready to read.

Token beginnings look ahead pointer, The distance which the look ahead pointer may have to travel past the actual token may be large.
For example, in a PL/I program we may see: DECALRE (ARG1, ARG2… ARG n) without knowing whether DECLARE is a keyword or an array name until we see the character that follows the

The document Input Buffer & Lexical Analyzer Generator | Compiler Design - Computer Science Engineering (CSE) is a part of the Computer Science Engineering (CSE) Course Compiler Design.
All you need of Computer Science Engineering (CSE) at this link: Computer Science Engineering (CSE)
Are you preparing for Computer Science Engineering (CSE) Exam? Then you should check out the best video lectures, notes, free mock test series, crash course and much more provided by EduRev. You also get your detailed analysis and report cards along with 24x7 doubt solving for you to excel in Computer Science Engineering (CSE) exam. So join EduRev now and revolutionise the way you learn!
Sign up for Free Download App for Free
26 videos|67 docs|30 tests

Up next

FAQs on Input Buffer & Lexical Analyzer Generator - Compiler Design - Computer Science Engineering (CSE)

1. What is an input buffer in the context of computer science engineering?
Ans. In computer science engineering, an input buffer is a temporary storage area that holds data coming from an input device before it is processed. It allows the system to receive and store input data at a faster rate than it can be processed, preventing data loss or delays.
2. How does an input buffer help in the lexical analysis process?
Ans. An input buffer plays a crucial role in the lexical analysis process. It acts as a temporary storage for the input text, allowing the lexical analyzer to read and process the input one character or token at a time. By buffering the input data, the lexical analyzer can efficiently retrieve and analyze the required information, aiding in the overall efficiency and performance of the lexical analysis phase.
3. What is a lexical analyzer generator in computer science engineering?
Ans. A lexical analyzer generator is a software tool that automates the creation of lexical analyzers, also known as lexers or scanners. It takes a set of specifications, typically defined using regular expressions or other formal languages, and generates code for a lexical analyzer that can recognize and tokenize input text based on those specifications. This tool helps developers save time and effort in implementing the lexical analysis phase of a compiler or interpreter.
4. How does a lexical analyzer generator simplify the development process?
Ans. A lexical analyzer generator simplifies the development process by abstracting away the low-level details of building a lexical analyzer. It allows developers to define the desired lexical rules using a higher-level specification language, such as regular expressions, instead of manually writing complex code. The generator then automatically generates efficient and optimized code for the lexical analyzer, reducing the development time and potential errors.
5. What are some popular lexical analyzer generator tools used in computer science engineering?
Ans. There are several popular lexical analyzer generator tools used in computer science engineering, including: - Flex: Flex is a fast lexical analyzer generator that is commonly used in conjunction with the Bison parser generator. It supports the generation of C or C++ code for lexical analyzers. - ANTLR: ANTLR (ANother Tool for Language Recognition) is a powerful and widely-used lexical and syntactic analyzer generator. It supports multiple target languages, including Java, C#, and Python. - JFlex: JFlex is a lexical analyzer generator specifically designed for Java. It generates Java code for efficient lexical analysis. - Lex: Lex is a classic lexical analyzer generator that was developed along with the original Unix operating system. It generates C code for lexical analyzers and is still widely used today. - Coco/R: Coco/R is a compiler generation tool that supports the generation of lexical analyzers as well as parsers. It supports various target languages, including C#, Java, and C++.
26 videos|67 docs|30 tests
Download as PDF

Up next

Explore Courses for Computer Science Engineering (CSE) exam
Related Searches

Summary

,

Viva Questions

,

Input Buffer & Lexical Analyzer Generator | Compiler Design - Computer Science Engineering (CSE)

,

Exam

,

pdf

,

Sample Paper

,

study material

,

Input Buffer & Lexical Analyzer Generator | Compiler Design - Computer Science Engineering (CSE)

,

mock tests for examination

,

Extra Questions

,

Objective type Questions

,

Previous Year Questions with Solutions

,

Input Buffer & Lexical Analyzer Generator | Compiler Design - Computer Science Engineering (CSE)

,

ppt

,

practice quizzes

,

shortcuts and tricks

,

past year papers

,

Free

,

video lectures

,

Semester Notes

,

MCQs

,

Important questions

;