- Input/Output Basics
- Networking Made Easy
- Filehandles
- Using Object-Oriented Syntax with the IO::Handle and IO::File Modules
- Summary
Using Object-Oriented Syntax with the IO::Handle and IO::File Modules
We use Perl5's object-oriented facilities extensively later in this book. Although you won't need to know much about creating object-oriented modules, you will need a basic understanding of how to use OO modules and their syntax. This section illustrates the basics of Perl's OO syntax with reference to the IO:Handle and IO::File module, which together form the basis of Perl's object-oriented I/O facilities.
Objects and References
In Perl, references are pointers to data structures. You can create a reference to an existing data structure using the backslash operator. For example:
$a = 'hi there'; $a_ref = \$a; # reference to a scalar @b = ('this','is','an','array'); $b_ref = \@b; # reference to an array %c = ( first_name => 'Fred', last_name => 'Jones'); $c_ref = \%c; # reference to a hash
Once a reference has been created, you can make copies of it, as you would any regular scalar, or stuff it into arrays or hashes. When you want to get to the data contained inside a reference, you dereference it using the prefix appropriate for its contents:
$a = $$a_ref; @b = @$b_ref; %c = %$c_ref;
You can index into array and hash references without dereferencing the whole thing by using the -> syntax:
$b_ref->[2]; # yields "an" $c_ref->{last_name}; # yields "Jones"
It is also possible to create references to anonymous, unnamed arrays and hashes, using the following syntax:
$anonymous_array = ['this','is','an','anonymous','array']; $anonymous_hash = { first_name => 'Jane', last_name => 'Doe' };
If you try to print out a reference, you'll get a string like HASH(0x82ab0e0), which indicates the type of reference and the memory location where it can be found (which is just short of useless).
An object is a reference with just a little bit extra. It is "blessed" into a particular module's package in such a way that it carries information about what module created it.2 The blessed reference will continue to work just like other references. For example, if the object named $object is a blessed hash reference, you can index into it like this:
$object->{last_name};
What makes objects different from plain references is that they have methods. A method call uses the -> notation, but followed by a subroutine name and optional subroutine-style arguments:
$object->print_record(); # invoke the print_record() method
You may sometimes see a method called with arguments, like this:
$object->print_record(encoding => 'EBCDIC');
The "=>" symbol is accepted by Perl as a synonym for ','. It makes the relationship between the two arguments more distinct, and has the added virtue of automatically quoting the argument on the left. This allows us to write encoding rather than "encoding". If a method takes no arguments, it's often written with the parentheses omitted, as in:
$object->print_record;
In many cases, print_record() will be a subroutine defined in the object's package. Assuming that the object was created by a module named BigDatabase, the above is just a fancy way of saying this:
BigDatabase::print_record($object);
However, Perl is more subtle than this, and the print_record(), method definition might actually reside in another module, which the current module inherits from. How this works is beyond the scope of this introduction, and can be found in the perltoot, perlobj, and perlref POD pages, as well as in [Wall 2000] and the other general Perl reference works listed in Appendix D.
To create an object, you must invoke one of its constructors. A constructor is a method call that is invoked from the module's name. For example, to create a new BigDatabase object:
$object = BigDatabase->new(); # call the new() constructor
Constructors, which are a special case of a class method, are frequently named new(). However, any subroutine name is possible. Again, this syntax is part trickery. In most cases an equivalent call would be:
$object = BigDatabase::new('BigDatabase');
This is not quite the same thing, however, because class methods can also be inherited.
The IO::Handle and IO::File Modules
The IO::Handle and IO::File modules, standard components of Perl, together provide object-oriented interface to filehandles. IO::Handle provides generic methods that are shared by all filehandle objects, including pipes and sockets. The more specialized class, IO::File, adds functionality for opening and manipulating files. Together, these classes smooth out some of the bumps and irregularities in Perl's built-in filehandles, and make larger programs easier to understand and maintain.
IO::File's elegance does not by itself provide any very compelling reason to choose the object-oriented syntax over native filehandles. Its main significance is that IO::Socket, IO::Pipe, and other I/O-related modules also inherit their behavior from IO::Handle. This means that programs that read and write from local files and those that read and write to remote network servers share a common, easy-to-use interface.
We'll get a feel for the module by looking at a tiny example of a program that opens a file, counts the number of lines, and reports its findings (Figure 1.4).
Lines 13: Load modules We turn on strict syntax checking, and load the IO::File module.
Lines 45: Initialize variables We recover from the command line the name of the file to perform the line count on, and initialize the $counter variable to zero.
Line 6: Create a new IO::File object We call the IO::File::new() method, using the syntax IO::File->new(). The argument is the name of the file to open. If successful, new() returns a new IO::File object that we can use for I/O. Otherwise it returns undef, and we die with an error message.
Lines 79: Main loop We call the IO::File object's getline() method in the test portion of a while() loop. This method returns the next line of text, or undef on end of filejust like <>.
Each time through the loop we bump up $counter. The loop continues until getline() returns undef.
Line 10: Print results We print out our results by calling STDOUT->print(). We'll discuss why this surprising syntax works in a moment.
When I ran count_lines.pl on the unfinished manuscript for this chapter, I got the following result:
% count_lines.pl ch1.pod Counted 2428 lines
IO::File objects are actually blessed typeglob references (see the Passing and Storing Filehandles section earlier in this chapter). This means that you can use them in an object-oriented fashion, as in:
$fh->print("Function calls are for the birds.\n");
or with the familiar built-in function calls:
print $fh "Object methods are too effete.\n";
Many of IO::File's methods are simple wrappers around Perl's built-in functions. In addition to print() and getline() methods, there are read(), syswrite(), and close() methods, among others. We discuss the pros and cons of using object-oriented method calls and function calls in Chapter 5, where we introduce IO::Socket.
When you load IO::File (technically, when IO::File loads IO::Handle, which it inherits from), it adds methods to ordinary filehandles. This means that any of the methods in IO::File can also be used with STDIN, STDOUT, STDERR, or even with any conventional filehandles that you happen to create. This is why line 10 of Figure 1.4 allows us to print to standard output by calling STDOUT->print().
Of the method listings that follow, only the new() and new_tmpfile()i methods are actually defined by IO::File. The rest are inherited from IO::Handle and can be used with other descendents of IO::Handle, such as IO::Socket. This list is not complete. I've omitted some of the more obscure methods, including those that allow you to move around inside a file in a record-oriented fashion, because we won't need them for network communications.
$fh = IO::File->new($filename [,$mode [,$perms]])
The new() method is the main constructor for IO::File. It is a unified replacement for both open() and sysopen(). Called with a single argument, new() acts like the two-argument form of open(), taking a filename optionally preceded by a mode string. For example, this will open the indicated file for appending:
If called with two or three arguments, IO::File treats the second argument as the open mode, and the third argument as the file creation permissions. $mode may be a Perl-style mode string, such as "+<", or an octal numeric mode, such as those used by sysopen(). As a convenience, IO::File automatically imports the Fcntl O_* constants when it loads. In addition, open() allows for an alternative type of symbolic mode string that is used in the C fopen() call; for example, it allows "w" to open the file for writing. We won't discuss those modes further here, because they do not add functionality.$fh = IO::File->new(">darkstar.txt");The permission agreement given by $perms is an octal number, and has the same significance as the corresponding parameter passed to sysopen().
If new() cannot open the indicated file, it will return undef and set $! to the appropriate system error message.
$fh = IO::File->new_tmpfile
The new_tmpfile() constructor, which is called without arguments, creates a temporary file opened for reading and writing. On UNIX systems, this file is anonymous, meaning that it cannot be seen on the file system. When the IO::File object is destroyed, the file and all its contents will be deleted automatically.
This constructor is useful for storing large amounts of temporary data.
$result = $fh->close
The close() method closes the IO::File object, returning a true result if successful. If you do not call close() explicitly, it will be called automatically when the object is destroyed. This happens when the script exits, if you happen to undef() the object, or if the object goes out of scope such as when a my variable reaches the end of its enclosing block.
$result = $fh->open($filename [,$mode [,$perms]])
You can reopen a filehandle object on the indicated file by using its open() method. The input arguments are identical to new(). The method result indicates whether the open was successful.
This is chiefly used for reopening the standard filehandles STDOUT, STDIN, and STDERR. For example:
Calls to print() will now write to the file log.txt.STDOUT->open(">log.txt") or die "Can't reopen STDOUT: $!";$result = $fh->print(@args) $result = $fh->printf($fmt,@args) $bytes = $fh->write($data [,$length [,$offset]]) $bytes = $fh->syswrite($data [,$length [,$offset]])
The print(), printf(), and syswrite() methods work exactly like their built-in counterparts. For example, print() takes a list of data items, writes them to the filehandle object, and returns true if successful.
The write() method is the opposite of read(), writing a stream of bytes to the filehandle object and returning the number successfully written. It is similar to syswrite(), except that it uses stdio buffering. This method corrects the inconsistent naming of the built-in write() function, which creates formatted reports. The IO::File object method that corresponds to the built-in write() goes by the name of format_write().
$line = $fh->getline @lines = $fh->getlines $bytes = $fh->read($buffer,$length [,$offset]) $bytes = $fh->sysread($buffer,$length [,$offset])
The getline() and getlines() methods together replace the <> operator. getline() reads one line from the filehandle object and returns it, behaving in the same way in both scalar and list contexts. The getlines() method acts like <> in a list context, returning a list of all the available lines. getline() will return undef at the end of file.
The read() and sysread() methods act just like their built-in function counterparts.
$previous = $fh->autoflush([$boolean])
The autoflush() method gets or sets the autoflush() mode for the filehandle object. Called without arguments, it turns on autoflush. Called with a single boolean argument, it sets autoflush to the indicated status. In either case, autoflush()i returns the previous value of the autoflush state.
$boolean = $fh->opened
The opened() method returns true if the filehandle object is currently valid. It is equivalent to:
$boolean = $fh->eofdefined fileno($fh);Returns true if the next read on the filehandle object will return EOF.
$fh->flush
The flush() method immediately flushes any data that is buffered in the filehandle object. If the filehandle is being used for writing, then its buffered data is written to disk (or to the pipe, or network, as we'll see when we get to IO::Socket objects). If the filehandle is being used for reading, any data in the buffer is discarded, forcing the next read to come from disk.
$boolean = $fh->blocking([$boolean])
The blocking() method turns on and off blocking mode for the filehandle. We discuss how to use this at length in Chapter 13.
$fh->clearerr $boolean = $fh->error
These two methods are handy if you wish to perform a series of I/O operations and check the error status only after you're finished. The error()i method will return true if any errors have occurred on the filehandle since it was created, or since the last call to clearerr(). The clearerr() method clears this flag.
In addition to the methods listed here, IO::File has a constructor named new_from_fd(), and a method named fdopen(), both inherited from IO::Handle. These methods can be used to save and restore objects in much the way that the >&FILEHANDLE does with standard filehandles.
$fh = IO::File->new_from_fd($fd,$mode)
The new_from_fd() method opens up a copy of the filehandle object indicated by $fd using the read/write mode given by $mode. The object may be an IO::Handle object, an IO::File object, a regular filehandle, or a numeric file descriptor returned by fileno(). $mode must match the mode with which $fd was originally opened. For example:
$saveout = IO::File->new_from_fd(STDOUT,">");$result = $fh->fdopen($fd,$mode)
The fdopen() method is used to reopen an existing filehandle object, making it a copy of another one. The $fd argument may be an IO::Handle object or a regular filehandle, or a numeric file descriptor $mode must match the mode with which $fd was originally opened.
This is typically used in conjunction with new_from_fd() to restore a saved filehandle:
$saveout = IO::File->new_from_fd(STDOUT,">"); # save STDOUT STDOUT->open('>log.txt'); # reopen on a file STDOUT->print("Yippy yie yay!\n"); # print something STDOUT->fdopen($saveout,">"); # reopen on saved value
See the POD documentation for IO::Handle and IO::File for information about the more obscure features that these modules provide.