Processing math: 100%

RustEmet1987.ox

Replicate Rust (Emet 1987) Engine Replacement Model using DDP.

RustEmet1987.ox replicates Table IX, Column 2, Rows 1 and 2 in Rust (Econometrica 1987), the celebrated bus engine replacement model.

Skip down to documentation of items defined in DP.ox

Overview
Model
Implementation
Remarks
Replication Output
Simulation

Overview

The program produces engine replacement probabilities that match those shown in Figure 3 on page 1025 of the paper. The code segments are not exactly the same as in the Ox file, which includes additional code beyond the basics shown here.

The paper models the decision to replace a bus engine as a function of mileage and unobserved IID shocks. Replacement is a one-time cost that resets the odometer to x=0. The cost of monthly maintenance increases linearly with x.

See RustEmet1987mle to see estimation using data as in the original paper.

The Model

An example of Rust class DDP:

$\zeta$ specification: ex ante extreme value shocks with

$\rho = 1.0.$

Clock: Ergodic (infinite horizon with no terminal or absorbing states)

Action:

$\alpha = (d)$ , d.N = 2, the binary decision to replace the engine or not.

Endogenous State

$\theta = (x),$ x.N = 90.

x is the mileage reading on the bus at its monthly maintenance, grouped into 5000 mile categories.

Transition: mileage next month is 0, 1 or 2 mileage bins more.

    x' = (1-d)x    w/ prob. θ₃₁
         (1-d)x +1 w/ prob. θ₃₂
         (1-d)x +2 w/ prob. 1-θ₃₁-θ₃₂

If x = 88 or 89 then the increment probabilities changed accordingly.

Semi-Endogenous: none

Endogenous: none

Utility:

The cost of regular maintenance or engine replacement.

U = -[ dRC + (1-d)θ₁mx ] + n + z_d    (5.1) p. 1015

z_d is the standard IID extreme value shock. (In the paper it is denoted ε)

Parameters:

Parameter	Row 1	Row 2
δ	0.9999;	0
RC	10.07	7.6538
θ₁	2.293	71.5133
θ₃	< 0.3919,0.5953>	< 0.3919,.5953>

The article suggests x enters the code in terms of thousands of miles (0 to 445,000), but I was not able to replicate results based on that. I was able to replicate replacement probabilities only when x takes on values between 0 and 89.

RC is the bus engine replacement cost. θ₁ is the operating cost given mileage x. The parameter m=0.001 is a scaling factor, which appears to rescale odometer readings. However, results were replicated when x is the bin number (0 to 89) not the odometer category.

Normalization

The paper refers to a normalization when displaying V() (see Figure 2, p. 1025). The normalization is not explained, but my calculations seem to explain why it is necessary. The closed form extreme-value value function iteration involves taking exponents of negative values (operating costs). When δ=0.9999 and n=0 above the result is numerical overflow in exp() With n>0 set properly the arguments are kept closer to 0. In the replication the normalization n is set to the maintenance cost for x=89/2, the median mileage category.

Implementation in DDP

Declarations in the Header File

Header File

Executable Code in the Ox File

Ox File

Remarks

Zurcher is derived from the Rust class

code>Rust class embeds an infinite horizon clock and a single binary action (d). It defines constants and declares a static member x>. Zurcher then declares the required method Utility().

Utility must have that name and cannot be static.

The form of the expression returned as the value should look related to the mathematical expression U() above. The parameters are static members of the class (they are the shared by each state θ). The current value of x is accessed as CV(x). The value of x is set to be the correct value at θ before Utility() is called.

Utility must return a column vector of values corresponding to the feasible action vector A[Aind]. In this case the action variable is simply d, and engine replacement is always feasible.

In mathematical notation, we would write the choice set as

$\alpha \in A(\theta)$ . Rather than accessing that directly, the column of values for d is accessed using CV(d). This makes the code closer to the notation and more general since it will work even if other actions are added to the model.

The tasks required to set up and solve the model are in Zurcher::Run(). Like Run(), a user's program must do these tasks in order:

Call Initialize(new Zurcher()) for the parent DDP class. Since Zurcher is of type Rust, the program is calling Rust::Initialize(), which will set the clock and add the binary action variable to the model, d.
Create state variables (using the new operator) and add them to the model.
Call CreateSpaces();.

The state variable x is of type Renewal, which captures the process in Rust (1987) but is more general than in the empirical specification.

To properly capture the transition it must be coordinated with the choice variable d. To do this d is passed to new Renewal() as an argument, along with how many categories x takes on and the vector of increment probabilities. The length of the vector determines how different states are feasible next period. In the empirical work, 3 of 90 odometer readings are feasible next period.

Once CreateSpaces() has been called new actions and new state variables cannot be added to the model because the state spaces have be set up already.

To solve the model the following steps are carried out

Set the discount factor δ. (This can be done at any time after Initialize() before solving the model with that δ.)
Create a new instance of a solution Method, such as value function iteration.
Call Solve() for the solution method to compute $V(\theta)$ and $P^\star(\alpha|\theta)$ .
Use the results, then if necessary change some settings or parameters and solve again.

The code inside the for() loop solves the model twice, for Row 1 and Row 2 parameters.

For each set of parameters the value function is solved through Bellman iteration by Emax->Solve(). It calls the Output() routine to capture results to be printed and compared with Figure 3 in the paper.

Output

       Output produced: niqlow/examples/output/RustEmet1987.txt

The values of EV do not match those shown in Figure 2 of the paper. This is due to Rust using a different normalization than the value used here. However, the choice probabilities appear to match very well those shown if Figure 3, as the image below suggests.

Comparison to Figure 3

The paper labels the x axis as thousands of miles since engine replacement. This means that x should take on values 0, 5, ..., 445. It also suggests that the factor on costs scales 300,000 to 300. However, using this scaling I was not able to replicate the choice probabilities. Only when x takes on values 0, ..., 89 and the scaling factor still equals 0.001 was I able to reproduce the choice probabilities.

Simulation

Another use of the model is to simulate data by creating a Panel object and simulating a sample of data from the process.

The call to Panel::Simulate creates a data set of 10 buses observed over 400 months each. The initial mileage for each bus will be drawn from the ergodic distribution.

The output file shows a simulated panel of buses, including the months and mileage at every replacement. The simulate routine returns a panel of observations.

But in the case of this model it is sufficient to show only months when an engine is replaced, the age of the bus when it occurs and the mileage reading. So the replication program deletes simulated months when no replacement occurred. It also appends the row number for the model.

For example, the first simulated bus data looks like this

Simulated data
Simulated data
  Fxed  path     t     n     T   Aj|   s11   s21     x     t    t'     r     f  |ai|     a
     0     0    92    50     0     0     0     0    50     0     0     0     0     1     1
     0     0   164    44     0     0     0     0    44     0     0     0     0     1     1
     0     0   248    53     0     0     0     0    53     0     0     0     0     1     1
     0     0   344    69     0     0     0     0    69     0     0     0     0     1     1

Bus 0 had four replacements, which occurred in months 92, 164, 248 and 344 of its lifetime. The mileage on the odometer at replacement was in the interval [5000x,5000(x+1)].

The column labeled Const is a placeholder for exogenous state variables.

If the model has no such variables then DDP inserts a constant there. This makes it possible to construct indices into the state space without requiring a check of zero dimensions.

Since the model is stationary there is no separate time variable. If this were a finite horizon or non-stationary environment then t would contain the age of the process.

DDP allows for random aging so t would not necessarily be equivalent to the months column.

The t' variable (t″ in the notes) is used during iteration on the value function. During simulation it is not used and would always have the value 0.

Author:: © 2011-2018 Christopher Ferrall

Zurcher

Public fields
bins	static const	Number of bins by table.
COL	static	column of Table
dfactor	static const	Discount factor by row.
DMETH	static	Discretization method: Month or Mileage
mfact	static const	scaling on cost
normalization	static	added to U() to avoid underflow
NX	static	# of bins, 90 or 175
parlist	static const	2x3x2x4 array of reported parameter estimates.
pars	static	parameter vector $\psi$ .
rc	static	value of RC
ROW	static	row of table/column.
th1	static	value of $\theta_1$
x	static	mileage state, x
Public methods
Output	static
Run	static	Setup and solve the model for each discount factor and generate the figure.
SetSpec	static	Set the target of the replication.
Utility		Computes the one period linear cost as 2x1 vector $U = dRC+(1-d)\theta_1 x + n.$
Enumerations
RNpars		tags for table/column/row choices.
Zparams		tags for estimated parameters.

Zurcher

Enumerations
RNpars	tags for table/column/row choices. MonthMethod: 0 for Replacment Month 1 Replacement Mileage 2 Rust ceil(). Table: either 0 or 1 for Table IX or X (N=90 or N=175) Column: 0, 1, or 2 for the column of the table (which bus groups to include) Row: either 0 or 1 for δ=.9999 or 0	MonthMethod, Table, Column, Row, Target
Zparams	tags for estimated parameters.	RC, theta1, theta3, LnLk, SEs, Nparams

bins

static const decl bins [public]

Number of bins by table.

COL

static decl COL [public]

column of Table

dfactor

static const decl dfactor [public]

Discount factor by row.

DMETH

static decl DMETH [public]

Discretization method: Month or Mileage

mfact

static const decl mfact [public]

scaling on cost

normalization

static decl normalization [public]

added to U() to avoid underflow

NX

static decl NX [public]

# of bins, 90 or 175

Output

static Zurcher :: Output ( chprob )

parlist

static const decl parlist [public]

2x3x2x4 array of reported parameter estimates.

pars

static decl pars [public]

parameter vector

$\psi$ .

rc

static decl rc [public]

value of RC

ROW

static decl ROW [public]

row of table/column.

Run

static Zurcher :: Run ( targ )

Setup and solve the model for each discount factor and generate the figure.

SetSpec

static Zurcher :: SetSpec ( targ )

Set the target of the replication.

This allows the basic code to be re-used for different specifications.

Parameters:

targ	is array of integer values for the Table, COlumn and Row from the original paper.

th1

static decl th1 [public]

value of

$\theta_1$

Utility

Zurcher :: Utility ( )

Computes the one period linear cost as 2x1 vector

$U = dRC+(1-d)\theta_1 x + n.$

x

static decl x [public]

mileage state, x

RustEmet1987.ox

Documentation of Items Defined RustEmet1987.ox

Zurcher

Zurcher

bins

COL

dfactor

DMETH

mfact

normalization

NX

Output

parlist

pars

rc

ROW

Run

SetSpec

th1

Utility

x