Why is a byte 8 bits? Or is it?
Computer History Vignettes
BY Bob Bemer
I recently received an e-mail from one Zeno Luiz Iensen Nadal, a worker for Siemens in Brazil. He asked "My Algorythms teacher asked me and my colleagues 'Why a byte has eight bits?' Is there a technical answer for that?"
Of course I could not resist a reply to someone named Zeno, after that teacher of ancient times. Some people copied on the reply thought it a useful document, so (having done the hard work already) I add it to my site as further bite OF History.
I am Way Behind in My Work, But I Just Cannot Reist Trying to Answer Your Questions on Why A "Byte" HAS Eight Bits.
The answer is that thing do, and some takes explaining, as flollows:
IF Computers Worked Entirely in Binary (AND Some Did A Long Time Ago), And Did Nothing But Calculation with binary number, there..
But to use and manipulate Character Information We Must Have Encodings for Those Symbols. And Much of this Was Already KNOWN FROM PUNCH CARD DAYS.
The punch card of IBM (others existed) had 12 rows and 80 columns. Each column was assigned to a symbol, a term I use here although they have fancier names nowadays because computers have been used in so many new ways.
THE Column, Going Down, Starting from the top, Were 12-11-0-1-2-3-4-5-6-7-8-9. A Punch in the 0 to 9 rows signified the digits 0-9 . A group of columns could Be Called A "Field", And a Number in Such A Field COULD Carry A Plus Sign For Top 12 of the Units Position of the Number, OR A Minus Sign ( An Additional Punch In Row 11 Just Under That.
Then they started to need alphabets. This was accomplished by adding the 12 punch to the digits 1-9 to make letters A through I, the 11 punch to make letters J through R. For S through Z they added the 0 punch to the digits 2 THROUGH 9 (THE 0-1 Combination Was Skipped - 3x9 = 27, But The English Alphabet Has Only 26 Letters). The 12, 11, and 0 Punches Were Called "Zones", and You'll Notice Today Lurking in . the high-order 4 bits Remember that this was much prior to binary representations of those same characters.The first bonus was that the 12 and 11 punches without any 0-9 punch gave us the characters and -. But no other punctuation was represented then, not even a period (dot, full stop) in IBM or telecommunication equipment. One can see this in early telegrams, where one said "I MISS YOU STOP COME HOME STOP". "STOP" stood for the period the machine did NOT HAVE.
................ ...
In this Way, With 10 Digits, 26 Alphabetic, And 11 Others, IBM Got To 47 Characters. Univac, with Different Punch Cards (Round Holes, Not Rectangles, And 90 Column, Not 80) Got To About 54. But Most of these were commercial characters. When FORTRAN came along, they needed, for example, a "divide" symbol, and an "=" symbol, and others not in the commercial set. So they had to use an alternate set of rules for scientific and mathematical Work. A Set of Fortran Cards Would Cause Havoc in Payroll!
With many early computers these punch cards were used as input and output, and inasmuch as the total number of characters representable did not exceed 64, why not use just 6 bits each to represent them? The same applied to 6-track punched tape for teletypes .In this period I came to work for IBM, and saw all the confusion caused by the 64-character limitation. Especially when we started to think about word processing, which would require both upper and lower case. Add 26 lower case letters to 47 Existing, And One Got 73 - 9 More 6 Bits Could Repest.
I Even Made A Proposal (in View of Stretch, The Very First Computer I Know of with with an 8-bit byte) That Would Extendes To 256 [1]. Some Folks Took it Seriously. I Thought of IT as a spoof.
So some folks started thinking about 7-bit characters, but this was ridiculous. With IBM's STRETCH computer as background, handling 64-character words divisible into groups of 8 (I designed the character set for it, under the guidance of Dr. Werner Buchholz The Man Who Did Coin The Term "Byte" for an 8-bit grouping). [2] It seenable to make a universal 8-bit character set, handling up to 256. in Those Day Day Mantra Was "Powers of 2 Are Magic ". And so the group i Headed Developed and justified such a proposal [3].
That was a little too much progress when presented to the standards group that was to formalize ASCII, so they stopped short for the moment with a 7-bit set, or else an 8-bit set with the upper half left for future work.
The IBM 360 used 8-bit characters, although not ASCII directly. Thus Buchholz's "byte" caught on everywhere. I myself did not like the name for many reasons. The design had 8 bits moving around in parallel. But then came a new IBM Part, with 9 both for self-checking, Both Inside the cpu and in the Tape Drives. I Exposed this 9-bit byte to the press in 1973. But Long Before That, When I Headed Software Operations for Cie. Bull in France In 1965-66, I insisted that "byte" be deprecated in favor of "octet" .You can notice that my preference is now the preferred term then. It is justified by new communications methods that can carry 16, 32, 64, and even 128 Bits in Parallel. But Some Foolish People Now Refer to a "16-Bit Byte" Because of this Parallel Transfer, Which Is Visible In The Unicode Set. I'm Not Sure, But Maybe this Should Be Called A "HEXTET".
But you will notice That I am Still Correct. Powers of 2 Are Still Magic!