Home > Articles > Security > Software Security

Writing Insecure C, Part 2

David Chisnall
  • PrintPrint
  • Share ThisShare This
  • DiscussDiscuss
Close WindowDavid Chisnall

David Chisnall

Learn more…

The State of Open Source 3D
Feb 9, 2010
What Is Mac OS X?
Feb 5, 2010
Snow Leopard: The Underhyped APIs
Jan 29, 2010
Foundation: The Objective-C Standard Library
Jan 26, 2010
Cocoa Tips: Exposing System Services
Jan 22, 2010
Cocoa Tips: Don't Reimplement Standard Functionality
Jan 15, 2010
Localizing Cocoa
Jan 8, 2010
How Core Animation Changed Cocoa Drawing
Jan 4, 2010
Using Distributed Objects in Cocoa
Jan 1, 2010
Inside Modern X11 Programming
Sep 18, 2009
Making JavaScript Fast, Part 2
Sep 15, 2009
Security in Your Pocket: OpenBSD on ARM
Sep 11, 2009
Making JavaScript Fast, Part 1
Sep 8, 2009
The Failure of the GPL
Aug 31, 2009
How Not To Optimize
Aug 21, 2009
A Half-Way Step to Apple’s Source Code: An Interview with David Chisnall
Jun 5, 2009
Advanced Flow Control for Objective-C
Jun 5, 2009
Erica Sadun on the iPhone SDK, OS X, and the Computing Landscape
Jun 5, 2009
From NeXTSTEP to Cocoa: Erik Buck on the Development of Cocoa and Objective-C
Jun 5, 2009
Fun with the Objective-C Runtime
Jun 5, 2009
Marcus Zarra and Matt Long on Core Animation
Jun 5, 2009
Steve Kochan on the Evolution of Objective-C
Jun 5, 2009
The Technology NeXT Gave the World
Jun 5, 2009
Where the Web and the Desktop Meet: An Interview with Lee Barney
Jun 5, 2009
Pandora: An Open Console
Jun 2, 2009
The Future of Wireless Networking
May 15, 2009
GNU or Linux?
May 11, 2009
Debugging C-Family Languages
Mar 27, 2009
How Small Is Your PC? The Rise of Netbooks and Other Small Form-Factor PCs
Mar 23, 2009
David Chisnall's CPU Feature Wishlist
Mar 13, 2009
The Dynamic Languages Renaissance
Jan 30, 2009
Robert Seacord on the CERT C Secure Coding Standard
Dec 15, 2008
Objective-C for C++ Programmers, Part 3
Nov 21, 2008
Objective-C for C++ Programmers, Part 2
Nov 14, 2008
Objective-C for C++ Programmers, Part 1
Nov 7, 2008
Writing Insecure C, Part 3
Oct 24, 2008
Writing Insecure C, Part 2
Oct 17, 2008
Writing Insecure C, Part 1
Oct 10, 2008
iRex iLiad e-Reader: Linux's Answer to the Kindle?
Aug 29, 2008
How It Works: Filesystems
Jun 13, 2008
How the LLVM Compiler Infrastructure Works
May 23, 2008
How It Works: Virtual Memory
May 21, 2008
What Is C For?
May 16, 2008
The Future of eBooks
Apr 25, 2008
Imagining an Open Network
Apr 18, 2008
Understanding How Xen Approaches Device Drivers
Mar 21, 2008
Examining the Legendary HURD Kernel
Mar 14, 2008
Competition Among Open Source Compilers
Feb 1, 2008
Inside Your OS: What is a Process Scheduler, and How Does it Work?
Jan 25, 2008
Bad UI of the Week: Read This (OK/Cancel)
Jan 18, 2008
The End of the Desktop Era
Jan 11, 2008
The What and Why of Open IM
Dec 28, 2007
A Look at the Modern X Server
Dec 21, 2007
The Future of Digital Media
Dec 14, 2007
The Future of Identity
Dec 7, 2007
Bad UI of the Week: Ask Forgiveness, Not Permission
Nov 21, 2007
Copyright Versus Free Software
Nov 16, 2007
Is Computer Science Dying?
Nov 9, 2007
A Brief History of Programming, Part 2
Nov 2, 2007
A Brief History of Programming, Part 1
Oct 26, 2007
The 700MHz Question: Will the Wireless Spectrum Auction Lead to Innovation or More of the Same?
Sep 28, 2007
Bad UI of the Week: The Menu Bar
Aug 24, 2007
The Dark Corners of x86
Aug 17, 2007
Bad UI of the Week: The Cross-Platform User Interface
Aug 17, 2007
Bad UI of the Week: The Mythical "is Like" Operator
Aug 10, 2007
Bad UI of the Week: Don't Make Me Tell You Twice...
Aug 3, 2007
Bad UI of the Week: Kettles and Washing Machines
Jul 27, 2007
The BBC iPlayer Controversy Explained
Jul 20, 2007
Bad UI of the Week: The Mitten Mouse
Jul 20, 2007
Bad User Interface of the Week: File It Under “Bad”
Jul 13, 2007
Bad User Interface of the Week: The DVD
Jul 6, 2007
A Roundup of Free Operating Systems
Jun 22, 2007
DragonFly BSD: UNIX for Clusters?
Jun 15, 2007
CPU Wars, Part 3: Put Your Left ARM In
May 18, 2007
CPU Wars, Part 2: POWER to the People
May 11, 2007
CPU Wars, Part 1: When the Chips Are Down
May 4, 2007
ZFS Uncovered
Apr 6, 2007
Vector Programming with GCC
Mar 30, 2007
Free Software Versus Open Source Software
Mar 16, 2007
What Programming Languages Should You Know?
Mar 9, 2007
Standardizing UNIX
Feb 2, 2007
Prolog: Logic Programming for Rapid Development
Jan 26, 2007
POSIX Parallel Programming, Part 3: Threads
Jan 19, 2007
POSIX Parallel Programming, Part 2: Message Passing
Jan 12, 2007
POSIX Parallel Programming, Part 1
Jan 5, 2007
The Nokia 770 Revisited
Dec 29, 2006
The Open Source Desktop Myth
Dec 22, 2006
Separating Style and Content: LaTeX and Typesetting
Dec 1, 2006
GNUstep: A Free Software alternative to OpenStep
Nov 10, 2006
Behind the Scenes of Objective-C 2.0
Nov 3, 2006
The Future of CPUs: What's After Multi-Core?
Oct 27, 2006
What Makes a Good Programming Language?
Oct 20, 2006
Emulation: Role-Playing for Computers
Oct 13, 2006
NetBSD: Not Just for Toasters
Oct 6, 2006
POSIX Asynchronous I/O
Sep 22, 2006
Breaking Down GPL Version 3
Aug 18, 2006
The Role of Binary Drivers in a Free OS
Aug 4, 2006
Security Is a UI Problem
Jul 28, 2006
Debunking the Myth of High-level Languages
Jul 14, 2006
A Taste of Erlang, a Dynamic, Asynchronous Message-Passing Language
Jun 30, 2006
Alternatives to LAMP
Jun 2, 2006
BSD Packaging Systems
May 26, 2006
DRM: Digital Rights or Digital Restrictions?
May 4, 2006
Introducing OpenBSD 3.9
Apr 28, 2006
The Need for Virtualization and Xen
Mar 31, 2006
Making Effective Software TCO Calculations
Mar 24, 2006
10 Things I Hate About U(NIX) Revisited: Readers Speak
Mar 17, 2006
Comparing Open Source Licenses: GPL vs. BSDL
Feb 3, 2006
BSD: The Other Free UNIX Family
Jan 20, 2006
Measuring the Effectiveness of Application Security Policies
Jan 13, 2006
The Cost of Free Software
Dec 9, 2005
Nokia 770 Internet Tablet Week-long Test Drive
Nov 18, 2005
10 Things I Hate About (U)NIX
Nov 4, 2005
The Lure of Open Source Software: Why Consider It for Your Business?
Oct 14, 2005
Cocoa Tip of the Day, 1/29/10
By on January 29, 2010 No Comments

Don't ignore old versions of OS X.

Cocoa Tip of the Day, 1/28/10
By on January 28, 2010 No Comments

Exceptions should be exceptional.

Cocoa Tip of the Day, 1/27/10
By on January 27, 2010 No Comments

Explore the runtime system.

Cocoa Tip of the Day, 1/26/10
By on January 26, 2010 No Comments

Copy design patterns from Cocoa.

Cocoa Tip of the Day, 1/25/10
By on January 25, 2010 No Comments

Profile with Instruments.

Cocoa Tip of the Day, 1/22/10
By on January 22, 2010 No Comments

Expose system services.

Cocoa Tip of the Day, 1/21/10
By on January 21, 2010 No Comments

Always read the release notes for new OS X versions.

Cocoa Tip of the Day, 1/20/10
By on January 20, 2010 No Comments

Broadcast events with notifications.

Cocoa Tip of the Day, 1/19/10
By on January 19, 2010 No Comments

Port your code with GNUstep.

Cocoa Tip of the Day, 1/18/10
By on January 18, 2010 No Comments

Use CoreAnimation for caching.

Cocoa Tip of the Day, 1/15/10
By on January 15, 2010 No Comments

Don't recreate standard features.

Cocoa Tip of the Day, 1/14/10
By on January 14, 2010 No Comments

Don't forget NSCell.

Cocoa Tip of the Day, 1/13/10
By on January 13, 20102 Comments

Plan for code reuse.

Cocoa Tip of the Day, 1/12/10
By on January 12, 2010 No Comments

Remember the C in Objective-C.

Cocoa Tip of the Day, 1/11/10
By on January 11, 2010 No Comments

Separate interfaces and implementations.

Cocoa Tip of the Day, 1/8/10
By on January 8, 2010 No Comments

Think about localisation early.

Cocoa Tip of the Day, 1/7/10
By on January 7, 2010 No Comments

Read the Human Interface Guidelines.

Cocoa Tip of the Day, 1/6/10
By on January 6, 2010 No Comments

Don't optimise yet.

Cocoa Tip of the Day, 1/5/10
By on January 5, 2010 No Comments

Put controllers in nib files.

Cocoa Tip of the Day, 1/4/10
By on January 4, 2010 No Comments

Don't write code.

Cocoa Tip of the Day, 1/1/10
By on January 1, 2010 No Comments

Use Distributed Objects for local network communication.

CERT C Secure Coding Standard, The

Like this article? We recommend
CERT C Secure Coding Standard, The

Continuing his series on insecure C, David Chisnall points out some problems arising from handling of integers and memory in C.

In part 1 of this series, we looked at some of the problems that C causes with novice programmers, related to error checking and variable initialization. This week we'll dig a bit deeper and see how C handles integer values—an unexpected cause of security holes—and look at how we can build some less error-prone memory management routines on top of C.

Integer Issues

If you're used to high-level languages, C's support for numerical values may seem distressingly primitive. The same might be true even if you come from assembly language, since C also doesn't expose any of the condition codes found in a modern CPU to the programmer.

In a high-level language, numbers typically are arbitrary-precision, where the precision is defined by your need. C has quite a selection of integer types. The most common of these is int, which is expected to correspond to a machine word. On the machines where I learned C, this was 16 bits. Now it's typically 32 bits, even on architectures where the machine word is 64 bits, since a lot of code has been written now assuming that it's always 32.

One of the most common causes of strange behavior is trying to store a pointer value in an int. On common 32-bit platforms, this strategy works fine. On some 64-bit platforms it works, but on others it doesn't. The C99 standard defines a new integer type, intptr_t, which is guaranteed to be big enough to store a pointer. If you're planning to store a pointer in an integer, you should always use intptr_t.

I say pointer, and this is one of the other slightly confusing corners. C defines two kind of pointers, one that points to code and one that points to data. A pointer to a function isn't guaranteed to be the same size as one that points to data. On a lot of embedded platforms, it's common to use 16-bit pointers for code and 32-bit pointers for data. Casting a void* to a function pointer results in some data being discarded in this case.

The other core kinds of integer in C are char, short, and long. C99 also introduced a long long. None of these have fixed sizes, but they all have minimum sizes. (Technically, they have minimum ranges of values they can store, and make no guarantees as to their internal layout.) A short must be at least 16 bits, a long at least 32, and a long long at least 64. If you need a minimum amount of precision, pick one of these, and you may get a bit more space than you requested, depending on the architecture.

I haven't mentioned chars yet because they differ subtly from the other types. All of the other basic integer types are signed unless explicitly declared as unsigned. This isn't always the case with char. Sadly, it isn't never the case with chars, either—whether a char is signed or not depends entirely on the implementation. If you're using chars, always declare them as signed or unsigned explicitly; if you don't, you might be surprised later.

C has some quite surprising rules for implicit conversions between these types in operations. It's a common assumption that the precision of an operation will be specified by how you use it. For example, suppose you do this:

a = b + c;

Because you're storing the result in a, you might assume that the calculation would be performed with whatever the type of a is. In fact, it will be performed with the type of b or c. This makes sense when you consider that the alternative is to have the value of (b + c) dependent on something other than b and c themselves. You might assume that the type of this expression will be the type of b. The C99 standard defines a number of rules for determining the type, but in general it will be whichever is the larger type, b or c—for example, if a is a char and b is an int, the type of the expression will be an int. A fairly common bug comes from doing something like this:

long long a;
long b;// = something
long c;// = something

a = b * c;

a is at least 64 bits, b and c at least 32 bits. Since you can only guarantee that b and c are 32 bits, you shouldn't have put anything bigger than that in them, even if your compiler or platform has 64-bit longs. When you multiply them, you get a result that fits in a 64-bit integer, and you store it in something that fits a 64-bit integer. Sounds sensible? It is, except for the fact that the result is truncated to 32 bits just before the assignment. Here's the correct way of doing this:

a = (long long)b * c;

This sign-extends b to 64 bits (or more, depending on the implementation). The type promotion rules then kick in, ensuring that c has a type with as much space as b, so it's also sign-extended. The multiplication then occurs as a multiplication of a pair of long longs, with 32 or more leading zeros, and the result (a long long) is stored in a long long variable.

In general, you can avoid surprises by explicitly converting the types to something with enough precision for the operation first. Make sure that the signedness matches on both. This is a very common cause of errors, since you can accidentally lose the top bit when converting a big unsigned value to its signed equivalent.

A more subtle bug comes from the integer overflow. This is especially common with malloc, where a common pattern is to write malloc(i * sizeof(x)). If an attacker has any influence over i, he can try to make this overflow. For very large values of i, this will give a result much smaller than i, which is a problem. The call to malloc will succeed, but when you attempt to use the resulting array, only the first few values will be valid. An attacker could cause you to overwrite other data.

A simple way of avoiding this kind of hole is to use calloc() instead of malloc(). (Of course, you hope that your system's implementation of calloc will perform bounds-checking itself and not simply malloc() and memset() count*size bytes.)

realloc() is more of a problem. No standard way exists of doing this, so you need to do it yourself. Fortunately, OpenSSH includes xrealloc(), which is a bounds-checking version of realloc(). This includes a number of other checks, but if you don't need all of them you can implement a simplified version:

void * xrealloc(void *ptr, size_t nmemb, size_t size)
{
    void *new_ptr;
    size_t new_size = nmemb * size;
    if (SIZE_T_MAX / nmemb < size)
            return NULL;
        return realloc(ptr, new_size);
}

This test is quite simple. SIZE_T_MAX is the largest value that a size_t can have. When you divide by the number of elements requested, you get the maximum size that each element can be without an overflow. If this size is smaller than the size requested, you have an overflow, and so you return NULL.

realloc returns NULL in case of an error, so you should always be checking that the return value from realloc is valid. Unfortunately, this is a fairly common cause of memory leaks (which, in turn, can become denial-of-service attacks). If realloc() returns NULL, the original pointer is still valid. It's common for developers to forget this principle and just do something like the following:

ptr = realloc(ptr, newsize);

When realloc() return NULL, you've lost your reference to the old memory location. FreeBSD provides a convenience function, reallocf(), which is the equivalent of this:

void *reallocf(void* ptr, size_t size)
{
    void *newptr = realloc(ptr, size);
    if (NULL == newptr)
    {
        free(ptr);
    }
    return newptr;
}

If you don't have any alternative code paths for recovering when realloc fails, you would be advised to do something of this sort.

  • Share ThisShare This
  • Your Account

Discussions

Make a New Comment

You must log in in order to post a comment.

Related Resources

Jennifer  BortelWin FREE iPhone Developer Books and Videos- Introducing @InformIT Giveaways
By Jennifer Bortel on February 5, 2010 No Comments

Apples’s recent iPad announcement made our hearts flutter so we couldn’t resist making an announcement of our own!

Today marks the first ever @InformIT Giveaway!

We’ll regularly post a video like this one profiling spectacular prizes we’re giving away—from books and videos to T-shirts and other exciting stuff. Check out the video below to see the giveaways for today, and then scroll down for more prize details and instructions on how to win them!

Dustin Sullivan"Every OSX developer should have this book on their desk."
By Dustin Sullivan on February 1, 2010 No Comments

That was the sentence Mike Riley ended his recent Dr Dobb's CodeTalk review of Cocoa Programming Developer's Handbook with.

David ChisnallCocoa Tip of the Day, 1/29/10
By David Chisnall on January 29, 2010 No Comments

Don't ignore old versions of OS X.

See All Related Blogs

Informit Network