A spellchecker used to be a major feat of software engineering (2008)
148 comments
·August 9, 2025keiferski
AnotherGoodName
Yes but it’s much broader. Just in general the lack of Steve Jobs noticing these glaring issues and coming down hard to solve them is pretty clear.
I remember when macbooks briefly came out with a ridiculously bright standby led that required Black electrical tape over if you wanted to sleep with it in the house. Shortly after no more status leds on any MacBook (thank you!).
Nowadays i find non stop little annoyances with threads from others on the same issues on Apple devices. From.the.overly.prominent.full.stop when searching textually in the url bar to the crappy spell check and crappy spam filtering. As much as Jobs apparently came across as an asshole there’s a need for someone at the top to say ‘WTF is this, fix it or get fired!’.
gitpusher
I worked at Apple and heard a lot of Steve stories. He really did personally approve everything. He would be sitting in a room, and team leads would all line up to give their quick 2-minute update. So it's the MacBook Air guy's turn. He comes in and places his prototype down in front of Steve. Steve opens the lid. Two seconds later he picks up the laptop and heaves it so hard it skipped across the table like a stone on water: "I said fxxking INSTANT ON!!" The poor guy collected his prototype and exited the room. Later the MacBook Air launched... it fxxking turned on the moment you open the lid
mathiaspoint
Good product development really does seem to require some sort of leader who demands quality and smacks people when they don't deliver. Linux is nice because of Torvalds for example.
qmr
Fuck fuck fucking fuck.
It's just a word.
csb6
Sounds like a petulant child. Wholly unnecessary to get his point across.
FirmwareBurner
>Two seconds later he picks up the laptop and heaves it so hard it skipped across the table like a stone on water: "I said fxxking INSTANT ON!!"
When did the OG MacBook Air have instant on at launch in 2008?
IIRC the M1 brough Instant on and Jobs wasn't around anymore.
IshKebab
I've also found a lot of this stuff is due to naysayers telling people that things can't be fixed (because really they don't want to bother). You need a strong leader to say "no it can and we will".
abullinan
It takes a village. Also to be successful in tech it takes an asshole. No way around it. At some point all successful companies share an overly aggressive visionary. The entire company doesn’t need to be toxic, but the apex does. If you don’t like it, don’t climb the ladder.
ww520
Or the incentive aligned.
paradox460
Not just that, but the strong leader needs to ensure that it can be fixed.
Yelling at a rank-and-file to unfuck some random system, then not giving them any time, resources, or tools to fix it is just being a dictatorial dickhead.
mrweasel
It's hard to pinpoint exactly what it is, but yes, there seems to be an increasing number of small issue with Apple devices. They aren't major stuff simply not work, but yes, spam filter being pretty terrible, text overlapping on non flagship phones (e.g. the iPhone SE). All sorts of minor annoyances.
AnotherGoodName
Yep I've worked at other big tech where they had periodic "executive bug filing" where executives would flag minor things that are annoying. These minor issues would then have higher than normal priority purely by virtue of being flagged by an exec. I have a likely controversial opinion that this executive bug filing actually led to better outcomes.
It did break prioritization in the opinion of the ground level teams and their goals but I argue it's not bad to at least periodically do this since grating against the current org structure prioritization and goals is not a bad thing to do on occasion.
Chances are they'll find there's no team that considers themselves the owners of spell check or spam filtering and the goals the keyboard team are going for is likely some silly thing like "number of sentences with correct punctuation" leading to the current ridiculous outcomes where the period in the URL is way too prominent, especially considering we don't even type full URLS into the search bar that often these days.
Dear Apple leads: if you're reading this do a short initiative where execs aim to file an annoyance a day. It's not hard to find such. There will be some complaints at the ground level that these executive annoyances get too much priority but part of that will be because you're questioning lower level org priorities (a healthy thing to do!), not because the issues don't matter. The end result will bring Apple a bit more in line with the quality we saw during the Jobs period since this is exactly the kind of shake up he did on occasion.
zimpenfish
> From.the.overly.prominent.full.stop when searching textually in the url bar
One of the most aggravating things in iOS. Trips me up almost every day (and it's been there for what? 10 years now?)
AnotherGoodName
Wait until you realize that the icon of the period and spacebar don't at all line up to the touch area due to touch gravitation. You can tap slightly more on the spacebar side and still end up with a period. https://www.reddit.com/r/iphone/comments/1ekszul/comment/lgn...
So if you suffer from this it's not even your fault. You're literally hitting the spacebar but some incentive at Apple in their org structure has led to the period literally having waaaay too much weighting and the lack of exec oversight at Apple in the post Jobs days is leading to us all.typing.periods.whenever.we.just.wanted.to.search.
egypturnash
rightnnextntonthenspuriousnninsteadnofnanspace
elzbardico
That light was really helpful to do the occasional late night visit to the bathroom in a home where I lived where there were no light controls at the bedside.
eru
> Is there a reason why Apple's iPhone spellcheck is often really poor, significantly worse than both LLMs and just...human eyes?
It's somewhat funny that human performance is seen as a baseline here, and not the pinnacle of achievement to aim for.
(I agree with you. I just find it entertaining.)
jmkni
There's nothing more frustrating than when you type the word you want to type, it changes it to a different word, you delete it and type the word you wanted to type again and then rinse/repeat 3 to 4 times before you have the word you actually wanted.
And if you're not paying attention, your message ends up looking like you're having a stroke.
mikestew
It used to be that if you typed, deleted the correction, and retyped, that spelling would now be the preferred and you wouldn’t have to play that game anymore. Apple broke that years ago.
bcrl
I'd be happier if the suggested word didn't move between the time I saw it appear and the time my finger touched the screen.
nwellinghoff
The spell check is truly bad. It boggles the mind how this is even possible given how solved the problem is everywhere else. Also the period being to the right of the spacebar such that it gets hit instead of space. So annoying!
moi2388
I have the same experience. Some things I’ve noticed:
- they really don’t want you saying bad words of any kind.
- they do not look at context at all
- they focus too much on the first letter of the word for suggestions
bapak
> they really don’t want you saying bad words of any kind.
Not true anymore, I just typed fuck in this comment without having to fight it. They made a change I think last year and they even announced it.
> they do not look at context at all
Also not true. It's true that they're not perfect at it, but replacement after you typed 2 more words happen specifically because it can tell better what you want to say. Sometimes works against you because language is highly personal.
nicce
Bit off-topic - macOS has excellent built-in dictionary. Just select the word in any app, press Ctrl+Command+D and it opens it. It even guesses most incorrect words correctly. Also translation available if it exist for current keyboard locales.
E.g.
> No entries for "typoglycemia", did you mean "hypoglycemia"?
qmr
There's also a gesture. Triple three finger tap or some such like.
bee_rider
These user activated dictionaries tend to be excellent (even in vim, a pretty barebones system, I tend to get fantastic guesses from the machine).
Actually, come to think of it, the problem must be a bit easier than on smartphones, right? Real keyboard input is very precise. Smartphone keyboards already guess what word you were trying to spell, so they are influencing the typos in the direction of likely words… cannibalizing the very guess list that the dictionary uses!
paradox460
Alfred ties into it nicely too, you can type `spell someword` and the completions below have the various spellings of words, fuzzy matched. Select one and the word goes onto your clipboard
fluidcruft
I feel the same way about Android's. It just seems like spell check used to be so much better then years ago. But I'm not sure whether it's comparing mobile with desktop expectations. It really seems extremely dumb on Android.
yndoendo
It is 2025 and the best spell checker is a search engine. Numerous time an application will not provide the correct word. Only solution is to try the word in a search engine and try using in a sentence if that fails.
In my opinion, this is where ML/AL local model, no internet required, would be the most beneficial today.
Even had to use a search engine with, "thoughts and opi" because I forgot how to spell opinion before posting this. In application spell checker was 100% useless with assisting me.
athrowaway3z
I've had a related idea for a while now.
Instead of how LLMs operate by taking the current text and taking the most likely next token, you take your full text and use an LLM to find the likeliness/rank of each token. I'd imagine this creates a heatmap that shows which parts are the most 'surprising'.
You wouldn't catch all misspelling, but it could be very useful information to find what flows and what doesn't - or perhaps explicitly go looking for something out of the norm to capture attention.
paol
I would like this too. This approach would also fix the most common failure mode of spelling checkers: typos that are accidentally valid words.
I constantly type "form" instead of "from" for example and spelling checkers don't help at all. Even a simple LLM could easily notice out of place words like that. And LLMs also could easily go further and do grammar and style checking.
NitpickLawyer
I've seen this in a UI. They went a step further and you could select a word (well token but anyway) and "regenerate" from that point by selecting another word from the token distribution. Pretty neat. Had the heatmaps that you mentioned, based on probabilities returned by the LLM.
This should also be pretty cheap (just one pass through the LLM).
null
anuramat
That's how BERT is trained, masked language modeling
simianwords
in fact it can work at the language level completely with a prompt like "mark parts of this paragraph that don't flow well".
golem14
Some new way of Stenography. That everyone can use. Would make taking notes so much easier.
jimt1234
Can confirm. The first time I saw an automatic spellchecker was probably with WordStar around 1989, and it blew me away. How can the computer know all the words? That's insane! Sounds lame, but it's true. It was a different world.
torium
Since you're old enough, here's a question for you. Do you remember if at the time the first spellcheckers were invented, people were negative on spellcheckers, because that would mean that soon people would stop learning how to spell and just general dumbing down?
It seems that anything that helps people gets this reaction these days. On the one hand, the argument 100% resonates with me. On the other hand, spelling isn't really the end, is it? It's just a means to an end, so what's wrong with making the mean easier? Did people worry that you'd stop knowing how to plant potatoes when trading was invented? EDIT: The example doesn't make sense because agriculture is newer than trading, but you got the idea.
aidenn0
I had an English teacher who allowed submissions only on typewriter or hand-written with spellcheckers mentioned as the reason.
mikaelmello
“The careless, the inept, the spelling disabled will be able to survive in a world of words by relying on computers to conceal their own weaknesses.”
https://www.washingtonpost.com/archive/opinions/1985/06/02/t...
runako
Not as much with spellcheckers because even when they started to get popular, it was apparent that many people cannot spell English. So it was very natural.
People pushed back on the grammar checks when they landed in Word.
Before that, people pushed back on calculators in secondary schools. This was a huge point of contention all classes except trigonometry, and calculators were definitely not allowed in the SAT/ACT.
kergonath
> People pushed back on the grammar checks when they landed in Word.
Word’s grammar checker has improved quite a lot. But I absolutely hate the style checker and its useless advice. Yes, I know how the passive voice works and yes, it is appropriate in this sentence. Also, it’s not really a problem in English but Word still can’t do spaces properly so it wants to put normal spaces everywhere and it’s fucking ugly. I wish it would spend as much time fixing inappropriate breaking spaces (in English as well).
Gibbon1
What I think is one should question if something is the point of the exercise.
I'd argue that where writing spelling is something completely arbitrary and thus of no fundamental importance. Arithmetic is the same way, lots of algorithms to do that and they are all valid. So calculators and spell checkers are fine. And you should use them.
The same is not true for grammar. Getting AI to write an essay for you.
bapak
I think this happens every time something gets automated away, and in a way it's true. I'm sure a lot more accountants knew 123x27 by heart before than they do now. The problem is LLMs take out the whole process of thinking, and that is going to be a problem: you generally need to think even when you're not in front of a screen.
paradox460
I remember hearing this as late as the early 00s. I'd buy electronic dictionaries and spell checkers at yard sales and things like that, and use them in class. Multiple teachers were disapproving of it, despite it basically just being a paper book dictionary in a small, TI-92 shaped device. 10 year old me never saw how flipping through some obnoxiously heavy book in the back of the classroom was better than just punching in a few letters, hitting the "show definition", and ensuring I was spelling and using "curmudgeonly" properly.
Same went for using MacWord vs AppleWorks. MacWord had a built in dictionary, AppleWorks didn't.
mrweasel
> because that would mean that soon people would stop learning how to spell and just general dumbing down?
I'd argue that negative people where correct. People can't spell anymore, not even with a spellchecker. Maybe they never could? I'm not against spellcheckers, I think they are amazing, but they haven't helped much.
null
KineticLensman
I don't remember any particular negative reaction to spell-checkers like the 'calculator panic'.
Perhaps partly because most schoolkids then wouldn't have been using word processors as their main writing tool at school and people using them in a corporate environment were pleased not to make embarrassing errors in their emails.
taftster
It was the opposite experience for me. Before spellcheck was commonly part of the web browser, I would go back and reread some very early emails and/or usenet posts from myself. And realize how atrocious my spelling was.
I actually consider spellcheck to have improved my spelling dramatically over the years. The little red squiggles under words have helped me to recognize my misspellings, especially the words that are hard for me to get right consistently.
NitpickLawyer
I had the same thing when the Encarta CDs started to include pronunciation tests. You'd get a word, speak it in the microphone, and get a "score" on how well you pronounced that word. Knowing what I know now, it was probably pretty inaccurate and hand wavy, but in the early 90s that was an absolutely amazing experience for an ESL person.
gnabgib
(2008) Popular in:
2023 (314 points, 180 comments) https://news.ycombinator.com/item?id=34971924
2020 (363 points, 143 comments) https://news.ycombinator.com/item?id=25296900
2012 (94+156 points, 70+61 comments) https://news.ycombinator.com/item?id=4640658 https://news.ycombinator.com/item?id=3466388
dang
Thanks! Macroexpanded:
A spellchecker used to be a major feat of software engineering (2008) - https://news.ycombinator.com/item?id=34971924 - Feb 2023 (180 comments)
A spellchecker used to be a major feat of software engineering (2008) - https://news.ycombinator.com/item?id=25296900 - Dec 2020 (143 comments)
A Spellchecker Used to Be a Major Feat of Software Engineering (2008) - https://news.ycombinator.com/item?id=10789019 - Dec 2015 (29 comments)
A Spellchecker Used to Be a Major Feat of Software Engineering - https://news.ycombinator.com/item?id=4640658 - Oct 2012 (70 comments)
A Spellchecker Used To Be A Major Feat of Software Engineering - https://news.ycombinator.com/item?id=3466388 - Jan 2012 (61 comments)
A Spellchecker Used to Be a Major Feat of Software Engineering - https://news.ycombinator.com/item?id=212221 - June 2008 (22 comments)
efitz
Having a dictionary is a prerequisite but is only a small part of the spell check problem. Plus, plain text word lists are slow to parse in the 80s; better going with a Trie or some other exotic tree structure that is naturally compressed but O(log(n)) instead of O(n) to traverse.
The computer has to figure out whether the word is in the dictionary, but it also has to figure out a suggestion for what to change it to.
And even after just that, we already have a bug- homonym mistakes- homonyms are in the dictionary but they’re misspelled (that was intentional btw).
How misspelled is another problem. We’ve had Levenshtein et al algorithms for a long time, but how different can you get? A really badly misspelled word might not have any good replacement candidates within your edit distance limit.
There are also optimizations like frequently mistyped words (acn-> can), acronyms, etc.
It was never just about size.
hilbert42
"A Spellchecker Used to Be a Major Feat of Software Engineering"
It still is. The spell checker on my Android phone is a PIA. It's too dumb to correct many typos, there's no way of highlighting wrongly used but correct words such a 'fro' and 'for', etc. There's no automatic or user defined substitution such as correcting 'rhe' with 'the' and yet keep the words highlighted until a final revision.
Wordpossessor spellers have no way of tagging certain words that one may or may not wish to use depending on context. A classic example that's caught me out past the draft and found its way into the final document without me noticing it is 'pubic' for 'public'. Why doesn't my speller highlight such words in red and ask whether I actually meant to use this word?
Moreover, spellers are not all of the same level of accuracy, for example Microsoft Word's speller is much better than LibrOffice's much to my annoyance as LibreOffice is my main (preferred) WP.
Nor is there a method of collecting misspelled words or typos and tagging them as spelling errors or typos for the purpose of helping one's spelling or typing. It'd be nice to have a list of my misspelled words together with their correct spelling, that way I could become a better speller. Also, spellers could be integrated with full dictionaries—highlight the word and press F1 for its meaning, etc.
There are no dictionary formats that are both universal and smart, that is that would allow for easy amalgamation between dictionaries and yet could contain user defined words and other user metadata which would be distinguished from the general corpus of words when crossed or amalgamated. For example, a smart dictionary format could contain metadata that would allow a dictionary and thesaurus to coexist in the same word list, similarly so different dictionaries, technical, medical etc.
All up, spellercheckers are still a damn mess. They need urgent attention.
dhosek
The way growth in memory availability changes the scope of problems is really quite astonishing. I cut my teeth writing code for Apple ][ computers with theoretically up to 128K of RAM, but in practice much closer to 40K for most use cases, but it does make me much more conscious of memory and CPU usage than younger devs who never faced these sorts of constraints.
Thinking of the example given about being able to just load the word list into memory, I did something of that ilk when my son’s fifth grade class read a book which had a concept of dollar words: You assign a value to each letter, a=1, b=2, … z=26, add up the value and try to get exactly 100. It was pretty trivial to write a program that read the word list and produced the complete list of dollar words (although I didn’t share that with my son, I did give him access to the word list and challenged him to write the program himself).
At the moment, I’m building up a Spanish rhyming dictionary by using a Spanish word list, reversing the words and sorting the reversed list to find the groups of words that are most likely to rhyme, which was something that 30 years ago would have been a challenge on my desktop computer but now is a brief script that I’m just as likely to manage through perl 1-liners and shell pipes as not.
chilipepperhott
Checking if a word is spelled correctly is easy. It is providing high-quality suggestions that is hard.
bluGill
Even that is hard since there are many words where a mispelling is a valid different word.
DJBunnies
Having written a spellcheck, maybe 20 years ago for school, this is accurate.
Finnucane
As a copyeditor/proofreader, the number of times over the years I've had to fix the low-quality (i.e, wrong) suggestions is quite large. ("he had a small plague on his desk" remains a favorite.)
aidenn0
"He looked in the [car's] grove department" is still my favorite.
cratermoon
I have a spelling checker
It came with my PC
It highlights for my review
Mistakes I cannot sea.
I ran this poem thru it
I'm sure your pleased to no
Its letter perfect in it's weigh
My checker told me sew.
https://www.thoughtco.com/spell-checker-poem-by-mark-eckman-...KineticLensman
One of my favourites was 'rouge elephant'.
DevX101
It still is? Few engineers could build a good spell checker without external libraries, giving a database of valid words.
aidenn0
TFA is just about flagging misspelled words, not making suggestions. That's something I would expect any undergraduate to be able to write.
Aurornis
> Few engineers could build a good spell checker without external libraries, giving a database of valid words.
Writing a spell checker that quickly identifies if a word is in a list of valid words (the problem described in the article) is a trivial problem for anyone who has basic algorithms and data structure knowledge. It's the classic example for using a trie: https://en.wikipedia.org/wiki/Trie
The problem described in the article is doing it within very limited storage space. How do you store your list of 200K words on a system with only 256K of memory? This is the challenging part.
mandeepj
> How do you store your list of 200K words on a system with only 256K of memory?
Your hard disk is almost always larger than your RAM. You only load into memory what's needed at the moment. I hope that gives a hint on how to proceed with the above problem.
aidenn0
OK. You have no hard disk, just a 360K floppy.
dhosek
But you don’t necessarily even have a hard disk. You might only have a 320K floppy. Floppy-only computers were pretty common in the late 80s when I was in undergrad.
sib
The article refers to the need to support machines that did not even have hard disk drives.
Hackbraten
What's a hard disk?
kccqzy
I was actually asked to build a spell checker in an interview. I immediately thought of Peter Norvig's article on spell corrector (https://norvig.com/spell-correct.html) and proceeded to explain. It turned out that the interviewer really wanted a spell checker not spell corrector: the program will only point out words not in the set of known words.
I failed that interview by overengineering.
dehrmann
> I failed that interview by overengineering.
Almost. You needed to clarify what the interviewer was asking and discover requirements. As much as HN likes to hate on coding interviews requiring specific algorithm knowledge, determining requirements is very much part of the job, and engineers have a tendency to build what they want to build, not what the customer wants.
elzbardico
I don't care about people who like to overengineer things, as long as they are humble and have awareness of this tendency. Sometimes you need the extra power and creativity. If they are affable and don't take critics personally, they usually get in with the program easily.
Gibbon1
Marvin the Paranoid Android voice. 'Oh... that. Just do a linear search on a dictionary. Put the small words first. I know it's woefully inefficient but you won't fuck that up'
adrianN
The existence of external libraries is part of the point.
justin66
It takes a PhD to develop that. (ahem)
thimabi
Given that spellcheckers are mostly stable tech, I wonder why Google’s spellchecker in Gmail, or even in Chrome in “enhanced” mode, is so bad.
Even Microsoft Word, being a local app and everything, manages to work better than Google’s cloud-based offerings. That’s surely evidence that progress is far from being linear.
devmor
There have been some very specific issues I and others have noticed that lead me to believe the backend for Google’s cloud based spellchecker has changed from a traditional language model to some more generalized LLM-based system. It’s gotten distinctly more terrible a couple of times in the last few years.
numpad0
It's also a key enabler to CJK typing on computers. CJK scripts never map to keyboards well, so instead of actually typing, approximate representations are typed in and regularized into written forms using similar technologies as spell checkers. It's a neat thing if you speak one of the languages, sort of interesting that a similar tool haven't been integrated into English keyboards.
grishka
Doesn't Chinese input usually work by typing Latin codes for characters? Korean characters represent syllables made up of shapes representing individual sounds, those fit on a keyboard just fine. And I'm not sure about Japanese, there they may use something like spell checkers to map kana to kanji.
Another interesting challenge with CJK languages was just displaying them. You need higher-resolution graphics and a much bigger character ROM to even consider that.
pcrh
This is a history Chinese keyboards before word processors, and even as they were first introduced: https://spectrum.ieee.org/chinese-keyboard
Very interesting! It was certainly a different technological challenge...
Also discussed here: https://news.ycombinator.com/item?id=40537464
paradox460
Romanization systems for Chinese vary, but all have the issue that a single "word" in the romanized system can map to dozens, if not hundreds, of actual "words" in the target language.
Pinyin is sort of the standard for romanization, although other systems exist, as well as inputs that aren't based on romanization (bopomofo).
Take the pinyin `fei`. Just looking at the tones that can be on this word, it can mean at least 4 words (my dictionary app couldn't find any neutral tone words). In reality, its at least dozens, each with different contextual meanings.
numpad0
IIUC there are ambiguity problems in Chinese and Korean, just less than there are for Japanese. Korean input has no end-of-character marks and multi-character entry could be split different ways, Chinese has bunch of homonyms-in-Latin, and Japanese is a huge mess(like always, if I think about it...)
knicholes
I loved what Peter Norvig did with this. It's not just a spell checker, but a spelling corrector. https://norvig.com/spell-correct.html
soperj
"why should they know about something so far __outisde__ their specialty?"
Should have used it on his spell-correct article.
Is there a reason why Apple's iPhone spellcheck is often really poor, significantly worse than both LLMs and just...human eyes?
I often find myself butchering the spelling of a word in a way where the correct answer is obvious to human eyes (probably because of "typoglycemia" [1]) and an AI LLM immediately understands what I meant to say, but Apple's spellcheck has "No Guesses Found."
Does anyone else have this experience?
1. https://www.dictionary.com/e/typoglycemia/