Arrowmail Logo




Also On
This Page


 

 

The Storage Tree - Understanding the Hierarchy of How Files are stored on PCs


I often come across PC users who store the files they create on either their Desktop or in
My Documents as those are the only places they feel confident of being able to locate them again.
If a CD ROM doesn't "autoplay" when they close the drive door, or a USB Flash drive doesn't automatically open a file listing window, then they're stuck.
They know how to download a file from a website but where it disappears to after it's downloaded is a mystery.
Making backups, organising files more logically and conveniently or deleting files are way too hard.

Many, more experienced, users still only have a vague understanding of what the options are for storing files and how all the various locations are interconnected to form the overall Storage Tree.

So, on this webpage, we've tried to provide the explanation that every new PC user should receive before starting to work with files.
No matter how user-friendly or intuitive a computer system may be, dealing with files is an inescapable part of what you'll need to do to keep your important data safe and accessible.



Getting the Basics out of the Way

What is a file?
A file is a group of 8 bit binary numbers which, when taken together and in the correct order, form a unit of information such as a Word document, a picture, a video or a piece of software.
For such an Information Unit to be a file it also needs to be stored on a medium which has a degree of permanence so that it will continue to exist after the storage device containing the storage medium is turned off.

An Information Unit stored in a computer's main Random Access Memory or in transit across the Internet isn't a file.
A file also needs to be able to be read from its location either into computer memory or onto another storage device. This requirement means that a picture displayed on a computer monitor isn't a file and the paper printout of all the binary number contained in a file also isn't a file.

A file can be any length from zero (8-bit) bytes to many gigabytes. The maximum size that a file can be is limited by the computer's operating system and the capability of the storage device.
This limit is gradually being extended upwards as a practical use is found for larger files.
It's worth pointing out that even a zero-length file will use up over 4000 bytes of hard drive storage, as storage devices allocate storage space in fixed sized chunks.
A high quality video file may be 4gb which can be stored on a DVD.
A file that holds the "image" of a hard drive can be many 100s of gigabytes which can only be stored on another hard drive or perhaps a magnetic tape.

A file also has "attributes" which are a combination of special file properties and data that describes the file. Data that describes data is usually called as metadata.
These attributes are usually stored separately from the file in a storage medium's "table of contents". Such attributes include File name, file size, time and date of last modification, read-only, compressed or encrypted.
When a file's read-only attribute is set, this is just a signal to the operating system not to allow the file's contents to be changed.
If a file's "compressed" or "encrypted" attributes are set then the whole contents of the file will also need to replaced by new data that is related to the original data by some mathematical formula that either makes the new data occupy less storage space than the original or makes it unreadable unless a "key" is provided to allow the mathematical process to be reversed.
The compressed and encrypted attributes are unusual in that they actually affect the data itself. All file systems I know of, allow files to be encrypted or compressed, but not both at the same time.
A Windows XP Explorer window displays a file in blue to indicate it's been compression and green to indicate it's been encrypted.


A new installation of Windows XP consists of around 30,000 files and, after many months of use, this can have grown to 300,000 or more.
Where have all these files come from?

Here's an example of what these files might consist of and how they've got onto your PC:-


 

File Category

Source

40%

Windows operating system and associated files
System files
Configuration files
Driver files
Log files
Help files

Windows installation CD and Windows Update Website

30%

Software programs you have installed

Program installation CD or installation files, downloaded from manufacturer's website

15%

Files you have downloaded
MP3s
Videos
Digital camera pictures

iTunes website,
Peer-to-peer networks,
Your digital camera

10%

Files created as an indirect result of your actions
A web browser's cache of  the web pages you've visited

Downloaded from the Internet or a temporary file created by a program for its own use

5%

Files you have actually created yourself
PowerPoint presentations
Budget in Excel
Home movie

Created in applications such as Microsoft Word


I'm not sure what category a virus or spyware fits into, but let's assume you don't have any on your PC.

What is a folder?
A folder is a storage location on a storage device that can contain files just like a real cardboard or plastic folder can contain pieces of paper.
A file-system folder can also contain other folders that can themselves contain other folders and this is where the analogy to real folders breaks down.

The purpose of folders is to organise files into logical groupings so that humans can more easily find one particular file and so make sense of the large amount of files, in total, on a storage device such as a hard drive.
Compartmentalising objects is a common strategy that humans use to organise and control large numbers of objects:-
Fresh food in the fridge, frozen in the freezer, cereals and packets in the pantry cutlery in the cutlery drawer, cleaning materials in under the sink, coffee mugs in the cupboard above the kettle; this way, we can quickly find any object we need in the kitchen.

Like real folders, a file system folder has a name and a location:-
Store-room 2 - Filing Cabinet 4 - 3rd drawer down - at the back - "Invoices April 2007"
A file system folder's location is its position in the hierarchy of other folders on the storage device.

A file-system folder also has attributes for instance it may be hidden and although you can set a folder to be encrypted, compressed and set access permissions to control who can do what with it, it's really just the files contained within the folder that are compressed, encrypted etc. rather than the folder itself.
If an unencrypted file is copied to an encrypted folder then the copy is automatically encrypted.

A folder only "exists" as an entry in the Table of Contents of a storage device.
If you were to examine how the files themselves are distributed throughout the storage device, there would be no folder structure apparent.

Folders, being a device to help humans organise files, are far less important than files and the only real data they contain is their name.

We generally create files and folders on a hard drive without considering that there are finite limits to the number and size of the files that can be stored in a folder and the number of times you can place a folder inside a folder inside a folder.
Fortunately drive manufacturers and operating systems keep extending these limits so that it's unlikely that we will ever manage to exceed the current limits that apply to the system we're using.
The only limit I've come up against in recent times is a CD writing program warning me of the limit of 8 nested folders allowed on a CD.

In storage devices, it's OK to have multiple files and multiple folders with the same name as long as they're not in the same location in the file system hierarchy.
A folder also cannot contain a file and a folder with the same name as each other.
In any case, it's a good idea to avoid having multiple files and folders with the same names.

The top of the folder hierarchy in a storage device is called the top-level folder or root folder, it's the only folder that isn't a sub-folder.
The folder above the current folder can be called the Parent Folder.

Directory is a term that means exactly the same as Folder in file storage systems but has dropped out of use in recent times however you may still hear the term, especially Root Directory for the top-level folder.

Where can files be stored?
The "Filing Cabinet" metaphor
What's all this about trees?

The way that folders are created and the way that files are stored in them can be compared to how a tree grows where braches split off other branches and leaves grow.
The root system of a tree has a similar shape to it's branches and so some people take the
file-system tree metaphor to mean a tree's root system.
I think it makes more sense to view it as the branches of a tree only turned upside down because the root system has no equivalent to leaves which represent files.
Upside-down branches or roots, use whichever analogy makes most sense for you.

Why Microsoft doesn't help.


The Bent Up-Arrow and Breadcrumbs

When you are viewing an Explorer window, displaying files and folders at some location in your computer's Storage Tree, you are looking downwards, deeper into the storage tree, as if floating face-down on the ceiling of the current folder, and so you can't see what's above you.
Even the folder immediately above your present location is not visible.
It's easy to move down deeper into the storage tree by double-clicking on a folder but moving upwards is harder.
That's what the bent up-arrow icon does, it moves you up one level in the storage tree.
In the Tree/Branch system, while a folder may have any number of sub-folders, it can only ever have one folder above it, called the Parent Folder, so the bent up-arrow only ever has one place to send you.
Repeatedly clicking the bent up-arrow will get you to the Desktop, even if it takes 10 or more clicks.

Going back further into computing history, 2 horizontal dots mean the same as the bent up-arrow symbol and you occasionally still come across this symbol today.

The dual-pane Explorer window you get when you run Windows Explorer are more flexible than the single pane window because the left-hand pane makes it easy to move up and down the Storage Tree.
This is why Windows Explorer should be the first program you run to perform all but the simplest file operations.
Outside of this program, all other Explorer windows are single-pane where the bent up-arrow is so useful.


No Bent Up-Arrow in Vista

Yes, Microsoft have decided that there's no place anymore for the bent up-arrow in Windows Vista. Just when you've learnt something useful it's thrown in the bin!
The bent up-arrow has been replaced by a breadcrumb trail.

In the story of Hansel and Gretel by the Brothers Grimm, after overhearing their parents' plan to lead them into a forest in order to abandon them, the children successfully find their way back home by following a trail of pebbles they dropped on the way out.
On the second attempt to lose them in the forest, they only manage to find breadcrumbs to drop, and when they come to follow this trail home, they find that the breadcrumbs have been eaten by the animal of the forest.

In recent years a new navigational aid has appeared on websites: a horizontal line near the top of the page showing where you are in the website and the path back to the Home page.
This is called Breadcrumbs.

In Vista, Explorer windows show the trail back up the Storage Tree hierarchy, and clicking on an entry on this trail gives a drop-down list of all the deeper locations that can be reached from that point.
Breadcrumbs are therefore more powerful than the old bent up-arrow as you can jump up and down to more locations in one click.
I miss the bent up-arrow in Vista and would have liked it to be displayed as well as the Breadcrumb Trail.
Breadcrumbs require more brain CPU cycles to work out what you want to do, especially if you only want to jump up 1 or 2 levels.


Microsoft Promotes the Desktop to Become the New Super-Root Folder

The first PC, released by IBM in 1981, had a single floppy drive (actually an optional extra) and so the top level folder of a floppy disc was obviously the Root of the Storage Tree - nothing complicated here.
In fact there was only the Root, as sub-folders only became available after an update to PC-DOS. Later, some PCs had 2 floppy drives and a 10mb hard drive - (75,000 times smaller than today's largest hard drive) - and, much later, a CD ROM drive and a mapped network drive.

The top level folder of each storage device was called the Root and, like separate trees in a forest, all seemingly unconnected and you "jumped" from one device to the other as necessary.

In Windows 95, Microsoft made a new, higher level super-root, root-of-roots, that established a single top-level point, uniting the storage devices, from which all possible file storage locations, accessible from a PC, branched off from - no more jumping.
They called this super-root the Desktop and it's still with us today.

The problem with adding locations in the Storage Tree, higher that the root folders of a storage device, is that they aren't real locations in which real files can be stored.

In fact Windows 95 introduced 4 new Storage Tree locations:-
The Desktop
My Computer - now called just Computer,
Network Neighborhood - renamed My Network Paces and again to just Network,
My Documents - also sometimes called, for instance, "John's Documents".
My Documents doesn't count as is not an integral part of the Grand Unified Storage Tree. I'll tell you why a bit later.

My Computer is  a location below the Desktop and contains storage devices that are present locally on the computer you are using; the ones you can see with the system box case open.
Network is also a location below the Desktop which contains storage devices "somewhere else" - across the room or across the world.

Both Computer and Network are concepts, virtual locations which cannot, themselves, contain files or folders and are useful as "signposts" to the files you want to find.
The Desktop is much cleverer than that as it can store real files and folders.
The Desktop is 3 things
1 - The graphical menu you see immediately after logging on which contains icons allowing you to quickly access the programs and files you use most often.
What's clever is that every user of a PC has their own Desktop, customised for their needs, including such irrelevancies as Wallpaper, Screen-saver, mouse pointer and system sounds.
In Windows 3.1, the predecessor of Windows 95, this was all that the Desktop was.
2 - The conceptual super-root of a computer's Storage Tree.
This is a very different role to being a graphical menu and is something that users' often don't realise.

Microsoft decided that a single unified storage tree was better than having many storage device trees in a forest - Linux has always worked with a single root - and, rather than invent a new super-root object, they chose to make the Desktop do double-duty.

^ Top of Page ^


Feedback Form

Type your message below:-

Anonymous feedback is fine. If you'd like a reply then we'll need an email address to send it to.
We'll only use it to reply to your feedback, nothing else!

 

Email Address:
(optional)

We sometimes like to publish visitor feedback, but only if you give us your permission:-

Choose a Screen Name for Published Comments:

 

If you prefer, you can, instead, email your comments to support@arrowmail.co.uk

^ Top of Page ^


 

Arrowmail is owned and operated by Rhebus Limited, a UK-registered company, number 4079706.
We welcome any comments about this website, good or bad. Send them to webmaster@arrowmail.co.uk