Natural Language Processing CSCI 4152/6509 — Lecture 6 Regular Expressions; Text Processing in Perl
Instructor: Vlado Keselj Time and date: 09:35–10:25, 16-Jan-2020 Location: Dunn 135
CSCI 4152/6509, Vlado Keselj Lecture 6 1 / 12
Natural Language Processing CSCI 4152/6509 Lecture 6 Regular - - PowerPoint PPT Presentation
Natural Language Processing CSCI 4152/6509 Lecture 6 Regular Expressions; Text Processing in Perl Instructor: Vlado Keselj Time and date: 09:3510:25, 16-Jan-2020 Location: Dunn 135 CSCI 4152/6509, Vlado Keselj Lecture 6 1 / 12
CSCI 4152/6509, Vlado Keselj Lecture 6 1 / 12
CSCI 4152/6509, Vlado Keselj Lecture 6 2 / 12
◮ read the textbook [JM] Chapter 2 ◮ Perl “Camel book” or many resources on Internet ◮ On bluenose server: ‘man perlre’ and ‘man perlretut’ ◮ The same effect: ‘perldoc perlre’ and ‘perldoc
◮ Or on the web:
CSCI 4152/6509, Vlado Keselj Lecture 6 3 / 12
CSCI 4152/6509, Vlado Keselj Lecture 6 4 / 12
CSCI 4152/6509, Vlado Keselj Lecture 6 5 / 12
CSCI 4152/6509, Vlado Keselj Lecture 6 6 / 12
CSCI 4152/6509, Vlado Keselj Lecture 6 7 / 12
Prototyping: good prototyping language, expressive: It can express a lot in a few lines of code. Incremental: useful even if you learn a small part of it. It becomes more useful when you know more; i.e., its learning curve is not steep. Flexible: e.g, most tasks can be done in more than one way Managed memory: garbage collection and memory management Open-source: free, open-source; portable, extensible RegEx support: powerful, string and data manipulation, regular expressions Efficient: relatively, especially considering it is an interpreted language OOP: supports Object-Oriented style
CSCI 4152/6509, Vlado Keselj Lecture 6 8 / 12
CSCI 4152/6509, Vlado Keselj Lecture 6 9 / 12