Learn how to manipulate and format data with AWK, a Mac and Linux command-line tool.
So you've got a large data set. New information is always coming in, and it's messy and inconsistent. Text files, Excel spreadsheets, web forms, email messages, and so on. You'd like to find a way to format it into something clean and consistent you can use in a database, and you'd like to automate that process as much as possible. Enter AWK, a command-line tool for Mac, Unix, and Linux that allows you to manipulate data in powerful ways.
In this course, award-winning author and teacher David D. Levine shows you how to use AWK to read and write data in a variety of formats, produce reports, and automate repetitive tasks. He reviews the nuts and bolts of the language, such as field separators, pattern matching, variables, operators, expressions, and control structures; functions available for manipulating data; and integration with other programs like Excel.
What you should know before watching this course
Using the exercise files
1. What Is AWK?
What is AWK?
2. AWK Command Line Basics
Writing an AWK program
Working with records, fields, patterns, and actions
Using AWK command-line flags
3. Understanding Records and Fields
Exploring basic input-field separators
Specifying field and record separators with variables
Challenge: Change a CSV file to a tab-separated one
Solution: Change a CSV file to a tab-separated one
4. Understanding Variables and Operators
Using built-in variables
Creating user-defined variables
Working with operators and arrays
5. A Quick Introduction to Regular Expressions
Regular expression basics
Working with character classes and quantifiers
6. Using Control Structures
Building control structures
Creating an HTML table
Challenge: Print only those lines consisting of a complete HTML entity
Solution: Print only those lines consisting of a complete HTML entity
7. Formatting the Output
Formatting output with printf()
Formatting output with width and precision specifiers