More Details About Hard Drives

Let's briefly look at a few more aspects of how hard drives operate.

We'll cover the following

Though we won’t spend too much time on it, there are some other interesting details about how hard drives operate.

Track skew

Many drives employ some kind of track skew to make sure that sequential reads can be properly serviced even when crossing track boundaries. In our simple example disk, this might appear as seen in the figure below.

Sectors are often skewed like this because when switching from one track to another, the disk needs time to reposition the head (even to neighboring tracks). Without such skew, the head would be moved to the next track but the desired next block would have already rotated under the head, and thus the drive would have to wait almost the entire rotational delay to access the next block.

Multi-zoned disk drives

Another reality is that outer tracks tend to have more sectors than inner tracks, which is a result of geometry; there is simply more room out there. These tracks are often referred to as multi-zoned disk drives, where the disk is organized into multiple zones, and where a zone is consecutive set of tracks on a surface. Each zone has the same number of sectors per track, and outer zones have more sectors than inner zones.

Cache

Finally, an important part of any modern disk drive is its cache, for historical reasons sometimes called a track buffer. This cache is just some small amount of memory, usually around 8 or 16 MB, which the drive can use to hold data read from or written to the disk. For example, when reading a sector from the disk, the drive might decide to read in all of the sectors on that track and cache them in its memory; doing so allows the drive to quickly respond to any subsequent requests to the same track.

On writes, the drive has a choice: should it acknowledge the write has completed when it has put the data in its memory, or after the write has actually been written to disk? The former is called write back caching (or sometimes immediate reporting), and the latter write through. Write back caching sometimes makes the drive appear “faster”, but can be dangerous; if the file system or applications require that data be written to disk in a certain order for correctness, write-back caching can lead to problems (read the chapter on file-system journaling for details).

ASIDE: DIMENSIONAL ANALYSIS

Remember in Chemistry class, how you solved virtually every problem by simply setting up the units such that they canceled out, and somehow the answers popped out as a result? That chemical magic is known by the highfalutin name of dimensional analysis and it turns out it is useful in computer systems analysis too.

Let’s do an example to see how dimensional analysis works and why it is useful. In this case, assume you have to figure out how long, in milliseconds, a single rotation of a disk takes. Unfortunately, you are given only the RPM of the disk, or rotations per minute. Let’s assume we’re talking about a 10K RPM disk (i.e., it rotates 10,000 times per minute). How do we set up the dimensional analysis so that we get time per rotation in milliseconds?

To do so, we start by putting the desired units on the left; in this case, we wish to obtain the time (in milliseconds) per rotation, so that is exactly what we write down: Time(ms)1Rotation\frac{Time\:(ms)}{1\: Rotation}. We then write down everything we know, making sure to cancel units where possible. First, we obtain 1minute10,000Rotations\frac{1\: minute}{10,000\: Rotations} (keeping rotation on the bottom, as that’s where it is on the left), then transform minutes into seconds with 60seconds1minute\frac{60\: seconds}{1\: minute}, and then finally transform seconds in milliseconds with 1000ms1second\frac{1000\: ms}{1\: second}. The final result is the following (with units nicely canceled):

Time(ms)1Rot.=1minute10,000Rot..60seconds1minute.1000ms1second=60,000ms10,000Rot.=6msRotation\frac{Time\:(ms)}{1\: Rot.}=\frac{1\: minute}{10,000\: Rot.}.\frac{60\: seconds}{1\: minute}.\frac{1000\: ms}{1\: second}=\frac{60,000\: ms}{10,000\: Rot.}=\frac{6\: ms}{Rotation}

As you can see from this example, the dimensional analysis makes what seems intuitive into a simple and repeatable process. Beyond the RPM calculation above, it comes in handy with I/O analysis regularly. For example, you will often be given the transfer rate of a disk, e.g., 100 MB/second, and then asked: how long does it take to transfer a 512 KB block (in milliseconds)? With dimensional analysis, it’s easy:

Time(ms)1Request=512KB1Request.1MB1024KB.1second100MB.1000ms1second=5msRequest\frac{Time\:(ms)}{1\: Request}=\frac{512\: KB}{1\: Request}.\frac{1\: MB}{1024\: KB}.\frac{1\: second}{100\: MB}.\frac{1000\: ms}{1\: second}=\frac{5\: ms}{Request}

Get hands-on with 1400+ tech skills courses.