An anecdote about backward compatibility

nye2k · 2026-01-31T09:35:10 1769852110

I love this absolute example of old systems interfering with new systems, rewriting old systems.

My old man started his tech work on hot rods, then mechanical typewriters, calculators, eventually continuing into mainframe electronics and nearly followed all the transitions up to today’s AI.

The number of times I’ve scratched my head at a problem and he had a clear understanding of where the logic broke… based on a historical decision that could not physically be undone.

fph · 2026-01-31T14:00:32 1769868032

If you try profiling almost any program that does linear algebra (something that uses Numpy, for instance), you will see a lot of calls and CPU time in functions with names like DGETRF or SGESVX. These obscure names stand for stuff like Single-precision GEneral matrix Solve Vector eXtended; i.e., solve a linear system of equations with a full, dense matrix. Why are they so difficult to parse? Couldn't they come up with a friendlier name?

They come from Lapack, the standard linear algebra foundation library, which is written in Fortran 77. That library was first written in 1992, when the Fortran 90 standard was still new and not supported everywhere, so they stuck with the earlier version. Lapack has become the standard library for dense non-parallel linear algebra; it is still maintained and updated, but the basic math algorithms haven't changed much, so there was no need to replace it entirely. Today there are also processor-specific libraries like MKL or Apple Accelerate, but they still all follow the same Lapack API.

When Fortran-77 was standardized, they decided to keep function names at most 6 letter long, to "ensure portability". I.e., they wanted to support certain compilers and architectures that were already considered old in 1977.

TL;DR: if you can't read easily those flame graphs today, it's because of backward compatibility with certain mainframes that probably date back to the late 1960s.

aebtebeten · 2026-01-31T18:52:06 1769885526

In particular, 6-letter long function names may have been convenient on mainframes that used 6-bit alphanumerics in 36-bit words, the 36-bits having been backward compatible with 10-decimal-digit electromechanical calculators.

https://en.wikipedia.org/wiki/36-bit_computing#History

EDIT: I had thought 10 digits of precision were required for certain calculations, but the WP article points out that they may have just corresponded to the operators having had 10 digits on 2 hands, in which case we're being backwards compatible with Hox genes, specifically Hoxd, and tetrapod pentadactyly is backwards compatible to hundreds of millions of years:

https://www.popsci.com/science/why-five-fingers-toes/

nyrikki · 2026-01-31T20:17:11 1769890631

Had more to do with punch cards and flexowriter tapes and octal, which predates large word sizes or even mainframes. Note the following from the MIDAS macro assembler [0]

Fortran predates this and was a different lineage than IBM, but not how six char symbols were a request

> The MACRO language had been used on the TX-0 for some three years previous to the writing of MIDAS. Hence, MIDAS incorporates most, of the features which have been requested by users of MACRO, such as more flexible macro Instructions, six character symbols and relocation.

Note that when porting b to the pdp-11, which was ascii vs the earlier FIODEC/flexowriter 6 bit paper tapes is why c case statements fall through, they used it to allow lower case commands in ed as an example.

Flexowriters are 1940s iirc, and TX-0 through the early pdps were octal so it makes sense to grow in multiples of the 3.3 bit lines of paper tape

[0] http://bitsavers.org/pdf/mit/rle_pdp1/memos/PDP-1_MIDAS.pdf

nyrikki · 2026-01-31T20:32:27 1769891547

Also note you can count to 12 on one hand and 60 with the other. That is why the ancient Sumerians used it. Base 10 was added to Roman abacus but they still kept the uncia (12) for some functions.

IIRC that wasn’t droop until the renaissance when they read Archimedes attempt to calculate the number of grains of sand needed to fill the universe with grains of sand, he used decimal and they asserted it was superior.

So you can consider decimal as tech debt:)

jsrcout · 2026-01-31T20:22:10 1769890930

At my first job circa 1990, our codebase was constrained to 6-character function names in the core libraries, which had to run on many platforms including mainframes. If I recall correctly, you could have longer names, but only the first 6 characters were significant to the linker.

Never thought about why that might be other than "yeah, memory is expensive".

AnimalMuppet · 2026-01-31T17:44:23 1769881463

Wasn't there a very similar library earlier than 1992? I seem to recall Linpack back in the early 1980s that sounded very similar.

fph · 2026-01-31T22:19:38 1769897978

That is correct, I did not mention Linpack. It had different function names than Lapack though (while the naming scheme was similar, and still constrained to 6 letters); for instance DGETRF was named DGEFA in Linpack. [1]

[1]: https://netlib.org/linpack/dgefa.f

buescher · 2026-01-31T17:54:09 1769882049

Yes. Lapack was the successor to linpack and I seem to recall some of the linpack routines going back much further than the eighties. MATLAB (which existed before the commercial release in 1984) was built on linpack.

KineticLensman · 2026-01-31T14:42:02 1769870522

Cue obligatory reference to the programmer archaeologists in Vernor Vinge's novel A Deepness in the Sky. Their job, on starships, is to safely bodge the multiple strata of software that have accreted since Mankind left Earth, centuries before.

jsrcout · 2026-01-31T20:23:44 1769891024

I'm pretty sure we've achieved that already, centuries ahead of schedule :-)

bo1024 · 2026-01-31T14:32:05 1769869925

The answer to any question of the form "why is something the way it is?" is always "historical reasons".

jsrcout · 2026-01-31T20:26:04 1769891164

Have seen this time and time again during my career.

Most of the time, it's something you could never conceivably figure out without having been there at the time. But after 10 seconds on the phone or a brief email from someone who was, it makes complete sense.

bob1029 · 2026-01-31T08:37:50 1769848670

IBM is the undisputed king of backward compatibility. There is code running on mainframes right now that is going on 50 years old. Microsoft is a close #2 with windows.

I'd probably consider using IBM if it wasn't so goddamn weird and expensive. I suppose all that backward compatibility does have its downsides. Windows feels a bit weird in some places too, but at the same time it didn't start out life as a typewriter.

reddalo · 2026-01-31T10:18:13 1769854693

>Windows feels a bit weird in some places too

Windows 11 still has some dialogs that haven't been touched (and they can't ever be, in order to prevent backward compatibility breakage) since Windows 3.1: https://www.windowsonwindows.com/forum/viewtopic.php?t=44

nottorp · 2026-01-31T11:54:37 1769860477

The ones that work every time, right?

duskdozer · 2026-01-31T12:13:13 1769861593

Hey, what's wrong with a little breakage every now and again and again and again, as long as it's new and fresh?

wiseowise · 2026-01-31T15:19:46 1769872786

It’s neumorphism, mom, and it’s important!

smiley1437 · 2026-01-31T15:10:53 1769872253

Strangely, this made me think about the recurrent laryngeal nerve in giraffes.

The nerve takes a 15-foot detour down the long neck and loops under the aorta near the heart before it travels back up because evolution needed to stay backwards compatible with previous iterations of protogiraffes as environmental selection pressure lengthened the neck.

fredley · 2026-01-31T15:22:01 1769872921

I love this fact. If you're a fish with no neck, the route it takes is the most direct and obvious. But as evolution gradually lengthened necks the route remained the same!

mikelitoris · 2026-01-31T08:20:05 1769847605

For those who don’t get it: It’s referring to the ink soaked ribbon that would print characters on a piece of paper, similar to a typewriter. This is a preceding technology to digital consoles. Also why most programming languages refer to outputting a string to stdout as “print”.

reddalo · 2026-01-31T10:14:52 1769854492

It's almost the same reason Windows still uses CR LF characters for new lines.

Not one character, but two: Carriage Return and Line Feed. Literally the action of moving the printer back to the beginning of the line and then the action of making the sheet of paper go "up" by one line.

randallsquared · 2026-01-31T18:30:39 1769884239

That's why those characters exist, but not why Windows uses both: Unix already used LF only, and the Apple II (and Mac, for a while) used only CR. The choice to use both was, as far as I know, Gary Kildall's, in CP/M, and various DOSes including MS-DOS inherited that decision without much examination.

bitwize · 2026-01-31T18:38:25 1769884705

It was a typewriter ribbon, and the type of terminal it was designed to be used with was a typewriter with communications circuitry, called a "teletypewriter". This is why the controlling terminal of Unix CLI/TUI processes is called a tty or pty (pseudo-tty).

jibal · 2026-01-31T09:03:27 1769850207

Similar? It is in fact a typewriter ribbon: https://www.amazon.com/Olympia-Typewriter-Ribbon-Black-Red/d...

TZubiri · 2026-01-31T15:28:02 1769873282

Man I'd love to have a computer that just prints stdout to a typewriter. Even if modern linux.

bitwize · 2026-01-31T18:41:23 1769884883

Score a GE TermiNet 30 or similar teletypewriting terminal off eBay, hook it to your PC via USB-to-RS232, and Bob's your uncle. You can even do a getty to it, log in, and run shell commands: https://www.youtube.com/watch?v=-Ul-f3hPJQM