Scheduling Checks and Saves

Leonid B. Boguslavsky
Leonid B. Boguslavsky
Institute of Control Sciences, GSP-312, Moscow, Russia
Search for more papers by this author
,
Edward G. Coffman, Jr.
Edward G. Coffman, Jr.
AT&T Bell Laboratories, Murray Hill, NJ 07974
Search for more papers by this author
,
Edgar N. Gilbert
Edgar N. Gilbert
AT&T Bell Laboratories, Murray Hill, NJ 07974
Search for more papers by this author
,
Alexander Y. Kreinin
Alexander Y. Kreinin
Institute of Control Sciences, GSP-312, Moscow, Russia
Search for more papers by this author

Leonid B. Boguslavsky

Institute of Control Sciences, GSP-312, Moscow, Russia

Search for more papers by this author

Edward G. Coffman, Jr.

AT&T Bell Laboratories, Murray Hill, NJ 07974

Search for more papers by this author

Edgar N. Gilbert

AT&T Bell Laboratories, Murray Hill, NJ 07974

Search for more papers by this author

Alexander Y. Kreinin

Institute of Control Sciences, GSP-312, Moscow, Russia

Search for more papers by this author

Published Online:1 Feb 1992https://doi.org/10.1287/ijoc.4.1.60

Abstract

A job is to be run on a machine subject to random failures. Failures are not self-evident. They must be detected by explicit tests, or checks. Checks detect failures to avoid wasting time working with a defective machine. After each successful check one has the option of saving the work just completed. Then, when failures occur, only the work done since the last save must be repeated. Effective use of checks and saves requires a compromise, since these procedures are themselves time-consuming. Scheduling saves alone, when failures are evident as soon as they occur, is often called checkpointing. The novelty of the model studied here stems from not assuming that failures are self-evident. This compounds the usual checkpointing problem by requiring schedules of failure checks as well as saves. This paper gives schedules of checks and saves that minimize the expected total time required to complete a given job. The most general failure mechanism considered is a renewal process. The Poisson process receives special emphasis, as it leads to the simplest results.

INFORMS Journal on Computing, ISSN 1091-9856, was published as ORSA Journal on Computing from 1989 to 1995 under ISSN 0899-1499.

Volume 4, Issue 1

Winter 1992

Pages 1-99

Article Information

Metrics

Information

Published Online:February 01, 1992

Cite as

Leonid B. Boguslavsky, Edward G. Coffman, Jr., Edgar N. Gilbert, Alexander Y. Kreinin, (1992) Scheduling Checks and Saves. ORSA Journal on Computing 4(1):60-69.

https://doi.org/10.1287/ijoc.4.1.60

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Scheduling Checks and Saves

Abstract

Volume 4, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News