Home > Articles > Security > Network Security

Information Flow

By Matt Bishop
Jan 3, 2005

📄 Contents

␡

⎙ Print

< Back Page 2 of 6 Next >

This chapter is from the book 

Introduction to Computer Security

Learn More Buy

15.2 Compiler-Based Mechanisms

Compiler-based mechanisms check that information flows throughout a program are authorized. The mechanisms determine if the information flows in a program could violate a given information flow policy. This determination is not precise, in that secure paths of information flow may be marked as violating the policy; but it is secure, in that no unauthorized path along which information may flow will be undetected.

Definition 15–3. A set of statements is certified with respect to an information flow policy if the information flow within that set of statements does not violate the policy.

EXAMPLE: Consider the program statement

if x = 1 then y := a;
else y := b;

By the rules discussed earlier, information flows from x and a to y or from x and b to y, so if the policy says that a ≤ y, b ≤ y, and x ≤ y, then the information flow is secure. But if a ≤ y only when some other variable z = 1, the compiler-based mechanism must determine whether z = 1 before certifying the statement. Typically, this is infeasible. Hence, the compiler-based mechanism would not certify the statement. The mechanisms described here follow those developed by Denning and Denning [247] and Denning [242].

15.2.1 Declarations

For our discussion, we assume that the allowed flows are supplied to the checking mechanisms through some external means, such as from a file. The specifications of allowed flows involve security classes of language constructs. The program involves variables, so some language construct must relate variables to security classes. One way is to assign each variable to exactly one security class. We opt for a more liberal approach, in which the language constructs specify the set of classes from which information may flow into the variable. For example,

x: integer class { A, B }

states that x is an integer variable and that data from security classes A and B may flow into x. Note that the classes are statically, not dynamically, assigned. Viewing the security classes as a lattice, this means that x's class must be at least the least upper bound of classes A and B—that is, lub{A, B} ≤ x.

Two distinguished classes, Low and High, represent the greatest lower bound and least upper bound, respectively, of the lattice. All constants are of class Low.

Information can be passed into or out of a procedure through parameters. We classify parameters as input parameters (through which data is passed into the procedure), output parameters (through which data is passed out of the procedure), and input/output parameters (through which data is passed into and out of the procedure).

(* input parameters are named i_s; output parameters, o_s; *)
   (* and input/output parameters, io_s, with s a subscript *)
   proc something(i₁, ..., i_k; var o₁, ..., o_m, io₁, ..., io_n);
   var l₁, ..., l_j;                (* local variables *)
   begin
   S;                     (* body of procedure *)
   end;

The class of an input parameter is simply the class of the actual argument:

i_s: type class { i_s }

Let r₁, ..., r_p be the set of input and input/output variables from which information flows to the output variable o_s. The declaration for the type must capture this:

o_s: type class { r₁, ..., r_p }

(We implicitly assume that any output-only parameter is initialized in the procedure.) The input/output parameters are like output parameters, except that the initial value (as input) affects the allowed security classes. Again, let r₁, ..., r_p be defined as above. Then:

io_s: type class {r₁, ..., r_p, io₁, ..., io_k }

EXAMPLE: Consider the following procedure for adding two numbers.

proc sum(x: int class { x };
             var out: int class { x, out });
begin
        out := out + x;
end;

Here, we require that x ≤ out and out ≤ out (the latter holding because ≤ is reflexive).

The declarations presented so far deal only with basic types, such as integers, characters, floating point numbers, and so forth. Nonscalar types, such as arrays, records (structures), and variant records (unions) also contain information. The rules for information flow classes for these data types are built on the scalar types.

Consider the array

a: array 1 .. 100 of int;

First, look at information flows out of an element a[i] of the array. In this case, information flows from a[i] and from i, the latter by virtue of the index indicating which element of the array to use. Information flows into a[i] affect only the value in a[i], and so do not affect the information in i. Thus, for information flows from a[i], the class involved is lub{ a[i], i }; for information flows into a[i], the class involved is a[i].

15.2.2 Program Statements

A program consists of several types of statements. Typically, they are

Assignment statements
Compound statements
Conditional statements
Iterative statements
Goto statements
Procedure calls
Function calls
Input/output statements.

We consider each of these types of statements separately, with two exceptions. Function calls can be modeled as procedure calls by treating the return value of the function as an output parameter of the procedure. Input/output statements can be modeled as assignment statements in which the value is assigned to (or assigned from) a file. Hence, we do not consider function calls and input/output statements separately.

15.2.2.1 Assignment Statements

An assignment statement has the form

y := f(x₁, ..., x_n)

where y and x₁, ..., x_n are variables and f is some function of those variables. Information flows from each of the x_i's to y. Hence, the requirement for the information flow to be secure is

lub{x₁, ..., x_n} ≤ y

EXAMPLE: Consider the statement

x := y + z;

Then the requirement for the information flow to be secure is lub{ y, z } ≤ x.

15.2.2.2 Compound Statements

A compound statement has the form

begin
     S₁;
   ...
   S_n;
   end;

where each of the S_i's is a statement. If the information flow in each of the statements is secure, then the information flow in the compound statement is secure. Hence, the requirements for the information flow to be secure are

S₁ secure
...
S_n secure

EXAMPLE: Consider the statements

begin
     x := y + z;
     a := b * c - x;
end;

Then the requirements for the information flow to be secure are lub{ y, z } ≤ x for S₁ and lub{ b, c, x } ≤ a for S₂. So, the requirements for secure information flow are lub{ y, z } ≤ x and lub{ b, c, x } ≤ a.

15.2.2.3 Conditional Statements

A conditional statement has the form

if f(x₁, ..., x_n) then
   S₁;
   else
   S₂;
   end;

where x₁, . . , x_n are variables and f is some (boolean) function of those variables. Either S₁ or S₂ may be executed, depending on the value of f, so both must be secure. As discussed earlier, the selection of either S₁ or S₂ imparts information about the values of the variables x₁, ..., x_n, so information must be able to flow from those variables to any targets of assignments in S₁ and S₂. This is possible if and only if the lowest class of the targets dominates the highest class of the variables x₁, ..., x_n. Thus, the requirements for the information flow to be secure are

S₁ secure
S₂ secure
lub{x₁, ..., x_n} ≤ glb{ y | y is the target of an assignment in S₁ and S₂ }

As a degenerate case, if statement S₂ is empty, it is trivially secure and has no assignments.

EXAMPLE: Consider the statements

if x + y < z then
     a := b;
else
     d := b * c - x;
end;

Then the requirements for the information flow to be secure are b ≤ a for S₁ and lub{ b, c, x } ≤ d for S₂. But the statement that is executed depends on the values of x, y, and z. Hence, information also flows from x, y, and z to d and a. So, the requirements are lub{ y, z } ≤ x , b ≤ a, and lub{ x, y, z } ≤ glb{ a, d }.

15.2.2.4 Iterative Statements

An iterative statement has the form

while f(x₁, ..., x_n) do
   S;

where x₁, ..., x_n are variables and f is some (boolean) function of those variables. Aside from the repetition, this is a conditional statement, so the requirements for information flow to be secure for a conditional statement apply here.

To handle the repetition, first note that the number of repetitions causes information to flow only through assignments to variables in S. The number of repetitions is controlled by the values in the variables x₁, ..., x_n, so information flows from those variables to the targets of assignments in S—but this is detected by the requirements for information flow of conditional statements.

However, if the program never leaves the iterative statement, statements after the loop will never be executed. In this case, information has flowed from the variables x₁, ..., x_n by the absence of execution. Hence, secure information flow also requires that the loop terminate.

Thus, the requirements for the information flow to be secure are

Iterative statement terminates
S secure
lub{x₁, ..., x_n} ≤ glb{ y | y is the target of an assignment in S }

EXAMPLE: Consider the statements

while i < n do
begin
     a[i] := b[i];
     i := i + 1;
end;

This loop terminates. If n ≤ i initially, the loop is never entered. If i < n, i is incremented by a positive integer, 1, and so increases, at each iteration. Hence, after n – i iterations, n = i, and the loop terminates.

Now consider the compound statement that makes up the body of the loop. The first statement is secure if i ≤ a[i] and b[i] ≤ a[i]; the second statement is secure because i ≤ i. Hence, the compound statement is secure if lub{ i, b[i] } ≤ a[i].

Finally, a[i] and i are targets of assignments in the body of the loop. Hence, information flows into them from the variables in the expression in the while statement. So, lub{ i, n } ≤ glb{ a[i], i }. Putting these together, the requirement for the information flow to be secure is lub{ b[i], i, n } ≤ glb{ a[i], i } (see Exercise 2).

15.2.2.5 Goto Statements

A goto statement contains no assignments, so no explicit flows of information occur. Implicit flows may occur; analysis detects these flows.

Definition 15–4. A basic block is a sequence of statements in a program that has one entry point and one exit point.

EXAMPLE: Consider the following code fragment.

proc transmatrix(x: array [1..10][1..10] of int class { x };
             var y: array [1..10][1..10] of int class { y } );
     var i, j: int class { tmp };
begin
    i := 1;                        (* b₁ *)
      ________________________________________________________________
      l2: if i > 10 goto l7;             (* b₂ *)
      ________________________________________________________________
      j := 1;                        (* b₃ *)
      ________________________________________________________________
      l4: if j > 10 then goto l6;        (* b₄ *)
      ________________________________________________________________
      y[j][i] := x[i][j];          (* b₅ *)
      j := j + 1;
      goto l4;
      ________________________________________________________________
      l6: i := i + 1;                  (* b₆ *)
      goto l2;
      ________________________________________________________________
      l7:                              (* b₇ *)
      end;

There are seven basic blocks, labeled b₁ through b₇ and separated by lines. The second and fourth blocks have two ways to arrive at the entry—either from a jump to the label or from the previous line. They also have two ways to exit—either by the branch or by falling through to the next line. The fifth block has three lines and always ends with a branch. The sixth block has two lines and can be entered either from a jump to the label or from the previous line. The last block is always entered by a jump.

Control within a basic block flows from the first line to the last. Analyzing the flow of control within a program is therefore equivalent to analyzing the flow of control among the program's basic blocks. Figure 15-1 shows the flow of control among the basic blocks of the body of the procedure transmatrix.

Figure 15-1 The control flow graph of the procedure transmatrix. The basic blocks are labeled b1 through b7.The conditions under which branches are taken are shown over the edges corresponding to the branches.

When a basic block has two exit paths, the block reveals information implicitly by the path along which control flows. When these paths converge later in the program, the (implicit) information flow derived from the exit path from the basic block becomes either explicit (through an assignment) or irrelevant. Hence, the class of the expression that causes a particular execution path to be selected affects the required classes of the blocks along the path up to the block at which the divergent paths converge.

Definition 15–5. An immediate forward dominator of a basic block b (written IFD(b)) is the first block that lies on all paths of execution that pass through b.

EXAMPLE: In the procedure transmatrix, the immediate forward dominators of each block are IFD(b₁) = b₂, IFD(b₂) = b₇, IFD(b₃) = b₄, IFD(b₄) = b₆, IFD(b₅) = b₄, and IFD(b₆) = b₂.

Computing the information flow requirement for the set of blocks along the path is now simply applying the logic for the conditional statement. Each block along the path is taken because of the value of an expression. Information flows from the variables of the expression into the set of variables assigned in the blocks. Let B_i be the set of blocks along an execution path from b_i to IFD(b_i), but excluding these endpoints. (See Exercise 3.) Let x_i₁, ..., x_in be the set of variables in the expression that selects the execution path containing the blocks in B_i. The requirements for the program's information flows to be secure are

All statements in each basic block secure
lub{x_i₁, ..., x_in} ≤ glb{ y | y is the target of an assignment in B_i }

EXAMPLE: Consider the body of the procedure transmatrix. We first state requirements for information flow within each basic block:

b₁: Low ≤ i ⇒ secure
b₃: Low ≤ j ⇒ secure
b₅: lub{ x[i][j], i, j } ≤ y[j][i]; j ≤ j ⇒ lub{ x[i][j], i, j } ≤ y[j][i]
b₆: lub{ Low, i } ≤ i ⇒ secure

The requirement for the statements in each basic block to be secure is, for i = 1, ..., n and j = 1, ..., n, lub{ x[i][j], i, j } ≤ y[j][i]. By the declarations, this is true when lub{x, i} ≤ y.

In this procedure, B₂ = { b₃, b₄, b₅, b₆ } and B₄ = { b₅ }. Thus, in B₂, statements assign values to i, j, and y[j][i]. In B₄, statements assign values to j and y[j][i]. The expression controlling which basic blocks in B₂ are executed is i ≤ 10; the expression controlling which basic blocks in B₄ are executed is j ≤ 10. Secure information flow requires that i ≤ glb{ i, j, y[j][i]} and j ≤ glb{ j, y[j][i] }. In other words, i ≤ glb{ i, y } and i ≤ glb{ i, y }, or i ≤ y.

Combining these requirements, the requirement for the body of the procedure to be secure with respect to information flow is lub{x, i} ≤ y.

15.2.2.6 Procedure Calls

A procedure call has the form

proc procname(i₁, ..., i_m : int; var o₁, ..., o_n : int);
   begin
   S;
   end;

where each of the i_j's is an input parameter and each of the o_j's is an input/output parameter. The information flow in the body S must be secure. As discussed earlier, information flow relationships may also exist between the input parameters and the output parameters. If so, these relationships are necessary for S to be secure. The actual parameters (those variables supplied in the call to the procedure) must also satisfy these relationships for the call to be secure. Let x₁, ..., x_m and y₁, ..., y_n be the actual input and input/output parameters, respectively. The requirements for the information flow to be secure are

S secure
For j = 1, ..., m and k = 1, ..., n, if i_j ≤ o_k then x_j ≤ y_k
For j = 1, ..., n and k = 1, ..., n, if o_j ≤ o_k then y_j ≤ y_k

EXAMPLE: Consider the procedure transmatrix from the preceding section. As we showed there, the body of the procedure is secure with respect to information flow when lub{x, tmp} ≤ y. This indicates that the formal parameters x and y have the information flow relationship x ≤ y. Now, suppose a program contains the call

transmatrix(a, b)

The second condition asserts that this call is secure with respect to information flow if and only if a ≤ b.

15.2.3 Exceptions and Infinite Loops

Exceptions can cause information to flow.

EXAMPLE: Consider the following procedure, which copies the (approximate) value of x to y.^[1]

proc copy(x: int class { x }; var y: int class Low);
var  sum: int class { x };
     z: int class Low;
begin
     z := 0;
     sum := 0;
     y := 0;
     while z = 0 do begin
           sum := sum + x;
           y := y + 1;
     end
end

When sum overflows, a trap occurs. If the trap is not handled, the procedure exits. The value of x is MAXINT / y, where MAXINT is the largest integer representable as an int on the system. At no point, however, is the flow relationship x ≤ y checked.

If exceptions are handled explicitly, the compiler can detect problems such as this. Denning again supplies such a solution.

EXAMPLE: Suppose the system ignores all exceptions unless the programmer specifically handles them. Ignoring the exception in the preceding example would cause the program to loop indefinitely. So, the programmer would want the loop to terminate when the exception occurred. The following line does this.

on overflowexception sum do z := 1;

This line causes information to flow from sum to z, meaning that sum ≤ z. Because z is Low and sum is { x }, this is incorrect and the procedure is not secure with respect to information flow.

Denning also notes that infinite loops can cause information to flow in unexpected ways.

EXAMPLE: The following procedure copies data from x to y. It assumes that x and y are either 0 or 1.

proc copy(x: int 0..1 class { x };
            var y: int 0..1 class Low);
begin
     y := 0;
     while x = 0 do
          (* nothing *);
     y := 1;
end.

If x is 0 initially, the procedure does not terminate. If x is 1, it does terminate, with y being 1. At no time is there an explicit flow from x to y. This is an example of a covert channel, which we will discuss in detail in the next chapter.

15.2.4 Concurrency

Of the many concurrency control mechanisms that are available, we choose to study information flow using semaphores [270]. Their operation is simple, and they can be used to express many higher-level constructs [135, 718]. The specific semaphore constructs are

wait(x): if x = 0 then block until x > 0; x := x - 1;
signal(x): x := x + 1;

where x is a semaphore. As usual, the wait and the signal are indivisible; once either one has started, no other instruction will execute until the wait or signal finishes.

Reitman and his colleagues [33, 748] point out that concurrent mechanisms add information flows when values common to multiple processes cause specific actions. For example, in the block

begin
     wait(sem);
     x := x + 1;
end;

the program blocks at the wait if sem is 0, and executes the next statement when sem is nonzero. The earlier certification requirement for compound statements is not sufficient because of the implied flow between sem and x. The certification requirements must take flows among local and shared variables (semaphores) into account.

Let the block be

begin
     S₁;
   ...
   S_n;
   end;

Assume that each of the statements S₁, ..., S_n is certified. Semaphores in the signal do not affect information flow in the program in which the signal occurs, because the signal statement does not block. But following a wait statement, which may block, information implicitly flows from the semaphore in the wait to the targets of successive assignments.

Let statement S_i be a wait statement, and let shared(S_i) be the set of shared variables that are read (so information flows from them). Let g(S_i) be the greatest lower bound of the targets of assignments following S_i. A requirement that the block be secure is that shared(S_i) ≤ g(S_i). Thus, the requirements for certification of a compound statement with concurrent constructs are

S₁ secure
...
S_n secure
For i = 1, ..., n [ shared(S_i) ≤ g(S_i) ]

EXAMPLE: Consider the statements

begin
     x := y + z;
     wait(sem);
     a := b * c - x;
end;

The requirements that the information flow be secure are lub{ y, z } ≤ x for S₁ and lub{ b, c, x } ≤ a for S₂. Information flows implicitly from sem to a, so sem ≤ a. The requirements for certification are lub{ y, z } ≤ x, lub{ b, c, x } ≤ a, and sem ≤ a.

Loops are handled similarly. The only difference is in the last requirement, because after completion of one iteration of the loop, control may return to the beginning of the loop. Hence, a semaphore may affect assignments that precede the wait statement in which the semaphore is used. This simplifies the last condition in the compound statement requirement considerably. Information must be able to flow from all shared variables named in the loop to the targets of all assignments. Let shared(S_i) be the set of shared variables read, and let t₁, ..., t_m be the targets of assignments in the loop. Then the certification conditions for the iterative statement

while f(x₁, ..., x_n) do
   S;

are

Iterative statement terminates
S secure
lub{x₁, ..., x_n} ≤ glb{ t₁, ..., t_m }
lub{shared(S₁), ,,,, shared(S_n) } ≤ glb{ t₁, ..., t_m }

EXAMPLE: Consider the statements

while i < n do
begin
     a[i] := item;
     wait(sem);
     i := i + 1;
end;

Now consider the compound statement that makes up the body of the loop. The first statement is secure if i ≤ a[i] and item ≤ a[i].The third statement is secure because i ≤ i. The second statement induces an implicit flow, so sem ≤ a[i] and sem ≤ i. The requirements are thus i ≤ a[i], item ≤ a[i], sem ≤ a[i], and sem ≤ i.

Finally, concurrent statements have no information flow among them per se. Any such flows occur because of semaphores and involve compound statements (discussed above). The certification conditions for the concurrent statement

cobegin
     S₁;
   ...
   S_n;
   coend;

are

S₁ secure
...
S_n secure

EXAMPLE: Consider the statements

cobegin
     x := y + z;
     a := b * c - y;
coend;

The requirements that the information flow be secure are lub{ y, z } ≤ x for S₁ and lub{ b, c, y } ≤ a for S₂. The requirement for certification is simply that both of these requirements hold.

15.2.5 Soundness

Denning and Denning [247], Andrews and Reitman [33], and others build their argument for security on the intuition that combining secure information flows produces a secure information flow, for some security policy. However, they never formally prove this intuition. Volpano, Irvine, and Smith [920] express the semantics of the above-mentioned information on flow analysis as a set of types, and equate certification that a certain flow can occur to the correct use of types. In this context, checking for valid information flows is equivalent to checking that variable and expression types conform to the semantics imposed by the security policy.

Let x and y be two variables in the program. Let x's label dominate y's label. A set of information flow rules is sound if the value in x cannot affect the value in y during the execution of the program. Volpano, Irvine, and Smith use language-based techniques to prove that, given a type system equivalent to the certification rules discussed above, all programs without type errors have the noninterference property described above. Hence, the information flow certification rules of Denning and of Andrews and Reitman are sound.

< Back Page 2 of 6 Next >

🔖 Save To Your Account

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.

Email Address

Information Flow

This chapter is from the book

This chapter is from the book

This chapter is from the book 

15.2 Compiler-Based Mechanisms

15.2.1 Declarations

15.2.2 Program Statements

15.2.2.1 Assignment Statements

15.2.2.2 Compound Statements

15.2.2.3 Conditional Statements

15.2.2.4 Iterative Statements

15.2.2.5 Goto Statements

15.2.2.6 Procedure Calls

15.2.3 Exceptions and Infinite Loops

15.2.4 Concurrency

15.2.5 Soundness

InformIT Promotional Mailings & Special Offers