Mathematicians discover new way for spheres to 'kiss'

73 comments

·January 16, 2025

danwills

I'd really love to know what the mathematicians are actually doing when they work this stuff out? Is it all on computers now? Can they somehow visualize 24-dimensional-sphere-packings in their minds? Are they maybe rigorously checking results of a 'test function' that tells them they found a correct/optimal packing? I would love to know more about what the day-to-day work involved in this type of research actually would be!

terminalbraid

> Is it all on computers now?

Most modern math is certainly not "all on computers" and in general not even "mostly on computers". There are definitely proofs for things like testing large spaces exhaustively which are sped up by computers (see the https://en.wikipedia.org/wiki/Four_color_theorem) and definitely for things like visualization (probably one of the oldest uses of computers for math), but usually the real work goes into how math has always been done: identifying patterns and abusing symmetries.

For this one explicitly, if you read through the paper you'll find the statement that the main theorem presented here "does not depend on any computer calculations. However, we have made available files with explicit coordinates for our kissing configurations"

viccis

It really depends though. Even in something like knot theory, that one might consider to be a very "pure" area, there's still a lot of computation involved that can be automated by computers.

iNic

The kind of intuition you gain for higher dimension tends not to be visual. It is more that you learn a bunch of tools and these in turn build intuition. For example high dimensional spheres are "pointy" and most of their volume are near their surface. These ideas can be defined rigorously and are important and useful. For medium dimension there are usually specific facts that you exploit. In my own work stuff like "How often do you expect random walks to intersect" is very important (and dependent on dimension).

david-gpu

> For example high dimensional spheres are "pointy" and most of their volume are near their surface

I had a visceral reaction to this. In what sense can a sphere be considered pointy? Almost by definition, it is the volume that minimizes surface area, in any number of dimensions.

I can see how in higher dimensions e.g. a hypersphere has much lower volume than a hypercube. But that's not because the hypersphere became pointy, it's because the corners of the hypercube are increasingly more voluminous relative to the volume of the hypersphere, right?

iNic

There is a standard thought experiment where you start with a hypercube of side-length 2, centered at the origin. You then place a radius 1 sphere on each vertex of this hypercube. The question then becomes: what is the largest sphere you can place at the origin so that it is "contained" by the other spheres. As it turns out in like dimension 6 or so the radius of the center sphere exceeds 1. It will actually poke out arbitrarily far (while still being restricted by the corner spheres).

btown

https://news.ycombinator.com/item?id=3995615 (both article and comments) describe various ways of looking at this - and there are many implications for machine learning e.g. https://news.ycombinator.com/item?id=3995964 !

null

[deleted]

jochi427

I remember learning about the probability of returning to the origin in a 2D random walk versus a 3D random walk when I took stochastic processes. After we proved with probability 1 you return to the origin in a 2D walk (and with probability 0 you return in 3D) my professor said "that's why you hand a drunk man the keys to a car and not an airplane when he leaves the bar". After checking wikipedia it looks like he riffed off this quote from Shizuo Kakutani: "A drunk man will find his way home, but a drunk bird may get lost forever".

CamperBob2

That's interesting, about the probability being zero in 3D. Is this on an integer lattice? The source that cannot be cited on HN without loss of karma says that the probability of returning to the origin in Z^3 is approximately 0.34.

I don't see how it could possibly be zero, even for reals, unless you're relying on the idea that the probability of any given real emerging from a uniform RNG is zero. That would seem to apply in 2D as well.

scythe

In many cases you are "translating" the higher-dimensional geometry into something that is not geometric or which is much lower dimensional. You don't generally visualize 24 dimensions. You can get a decent intuition for 4 with practice but at some point this breaks down.

For example, the 24-dimensional packing corresponds to the Leech lattice which itself corresponds to the Golay code:

https://en.wikipedia.org/wiki/Leech_lattice

https://en.wikipedia.org/wiki/Binary_Golay_code

jebarker

They definitely don't visualize 24 dimensional spheres. When I did my PhD in pure math I found that gradually I just became comfortable working without any visual or spatial intuition and instead relying on the (algebraic and topological) machinery that had been put in place before me. Terence Tao had a nice essay [1] talking about how the final stage in becoming a professional mathematician is developing the intuition to know what is likely true or not in these very abstract spaces.

I also never used a computer for anything other than latex.

[1] https://terrytao.wordpress.com/career-advice/theres-more-to-...

bell-cot

I suspect that you have plenty of company...but from a journalism PoV, those kind of things are where it gets tricky. Explaining in detail, and at length, is a lot more work than this short article. Then there are the decisions - "just how much detail?", "just how long?", (worse) "how much mathematical background should we assume, in our readers?", and (worst) "how willing will our readers be, to slog through serious mathematics?".

(I'm assuming you've already searched for math bloggers, and similar "labor of love" coverage of the topic.)

davethedevguy

Likewise!

In higher dimensions, are the spheres just a visual metaphor based on the 3-dimensional problem, or are mathematicians really visualising spheres with physical space between them?

Is that even a valid question, or does it just betray my inability to perceive higher dimensions?

This is fascinating and I'm in awe of the people that do this work.

jstanley

> just a visual metaphor

It's not really a metaphor.

An n-sphere is the set of all points that are the same distance away from the same centre, in (n+1)-dimensional space. That generalises perfectly well to any number of dimensions.

In 1 dimension you get 2 points (0-sphere), in 2 dimensions you get a circle (1-sphere), in 3 dimensions you get a sphere (2-sphere), etc.

EDIT: Also, if you slice a plane through a sphere, you get a circle. If you slice a line through a circle, you get 2 points. If you slice a 3d space through a hypersphere in 4d space, do you get a normal sphere? Probably.

close04

> etc.

That's handwaving the answer just as you were getting to the crux of the matter. "Are mathematicians really visualising spheres with physical space between them" in higher dimensions than 3 (or maybe 4)?

From the experience of some of the bigger minds in mathematics I met during my PhD, they don't actually visualize a practical representation of the sphere in this case since that would be untenable especially in much higher dimensions, like 24 (!). They all "visualized" the equations but in ways that gave them much more insight than you or I might imagine just by looking at the text.

zmgsabst

> If you slice a 3d space through a hypersphere in 4d space, do you get a normal sphere? Probably.

Yep — and this will generally be the case, as the equation looks like: x1^2 + x2^2 + … + xn^2 = r^2. If you fix one dimension, you have a hyperplane perpendicular to that axis — and a sphere of one dimension lower in that hyperplane.

For four dimensions, you can sort of visualize that as x^2 + y^2 + z^2 + t^2 = r^2, where xyz are your normal 3D and t is time. From t=-r to t=r, you have it start as a point then spheres of growing size until you hit t=0, then the spheres shrink back to a point.

JJMcJ

Note that the solid set, all points within a certain distance of the center, is called a ball: https://en.wikipedia.org/wiki/Ball_(mathematics)

If the boundary is included, it's a closed ball, otherwise it's an open ball.

So the sphere is the "skin", the ball is the whole thing.

A bit different than common usage.

aleph_minus_one

> In higher dimensions, are the spheres just a visual metaphor based on the 3-dimensional problem, or are mathematicians really visualising spheres with physical space between them?

For such discrete geometry problems, high-dimensional spaces often behave "weirdly" - your geometric intuition from R^3 will often barely help you.

You thus typically rather rely on ideas such as symmetry, or calculations whether "there is still space inbetween that you can fill", or sometimes stochastic/averaging arguments to show the existence of some configuration.

evandrofisico

In my PhD I did study systems in higher dimensions (including fractal dimensions) and it is not a metaphor and no, I did not visualize them, it was more like defining a mathematical representation of the system geometry and working on top of it.

bux93

I have a hard time visualizing even 3 dimension, but 4 dimensions and up, I just think of it as a spreadsheet where each thing has 4 or more columns of data rather than 3. Whether a 4th column is time, spin, color, smell or yet another coordinate.

nejsjsjsbsb

It sort of like the visualizable 3D "kissing spheres" is the story that makes it interesting, captivating and accessible and therefore competitive/social which makes it interesting even more, but basically at higher dims it's a bunch of equations as it is impossible to visualise on human wetware.

You could do kissing starfish but no one cares as there is no lore. A bit like 125m world record doesn't matter. 100m is the thing.

This is not a knock ... it is interesting how social / tradition based maths is.

Another example is Fermat's Last Theorem. It had legendary status.

tomrod

A circle from a flat 2d manifold can be from a 3d sphere, cylinder, or other cross section.

Our mental models don't extend well beyond 3, possibly 4, dimensions, hence _all_ of our intuition starts to be doubtful after 3 dimensions.

crazygringo

It's strange the article doesn't even mention just trying to simulate the problem computationally.

Surely it's not too difficult to repeatedly place spheres around a central sphere in 17 dimensions, maximizing how many kiss for each new sphere added, until you get a number for how many fit? And add some randomness to the choices to get a range of answers Monte Carlo-style, to then get some idea of the lower bound? [Edit: I meant upper bound, whoops.]

Obviously ideally you want to discover a mathematically regular approach if you can. But surely computation must also play a role here in narrowing down reasonable bounds for the problem?

And computation will of course be essential if the answer turns out to be chaotic for certain numbers of dimensions, if the optimal solution is just a jumble without any kind of symmetry at all.

quuxplusone

Maybe the author thought it was so obvious that you could get some lower bounds that way, that it didn't seem worth mentioning! :) Wikipedia has a list, where I presume the lower bounds are mostly demonstrated constructively. Upper bounds must be demonstrated non-constructively, so I presume computers don't really help there.

https://en.wikipedia.org/wiki/Kissing_number#Some_known_boun...

Even in dimension 5, the kissing number is apparently known only as "42 plus or minus 2."

awanderingmind

Here is an example of that sort of thing, using gradient descent as a starting point: https://arxiv.org/abs/math/0611451. It is technically about spherical codes rather than the kissing problem specifically, but they are closely related: https://en.wikipedia.org/wiki/Spherical_code

null

[deleted]

gosub100

Sort of a tangent, but here's a 20m video explaining how to invert a sphere without tearing it:

https://youtu.be/wO61D9x6lNY?si=ecBgnOemKAbYZCrP

nejsjsjsbsb

The interesting ta for me:

> Had she been one of his graduate students, he would have tried harder to convince her to work on something else. “If they work on something hopeless, it’ll be bad for their career,” he said.

anarchonurzox

A small anecdote: my dad is a mathematician. For a significant portion of his postdoc/early career (in the 80's/90's) he worked on proving a particular conjecture. Eventually he abandoned it and went to be much more successful in other areas.

A few years ago someone found a counterexample. He was quite depressed for a few weeks at the thought of how much of his strongest research years had been devoted to something impossible.

Choosing a "good first problem" in math is quite difficult. It needs to be "novel," somewhat accessible, and possible to solve (which is an unknown when you're starting out)!

nejsjsjsbsb

Thanks that is a good anecdote. Did he get over it and how?

To me such a career is useful for (a) the greater good: you can't make discoveries without dead ends and (b) the maths created along the way! Or if not shares then the skills developed.

burjui

At least, we can assume that after a few weeks, he became not quite that depressed.

senderista

At least he didn't "prove" a theorem that turned out to be false!

matsemann

> In two dimensions, the answer is clearly six: Put a penny on a table, and you’ll find that when you arrange another six pennies around it, they fit snugly into a daisylike pattern.

Is there an intuitive reason for why 6 fits so perfectly? Like, it could be a small gap somewhere, like in 3d when it's 12, but it isn't. Something to do with tessellation and hexagons, perhaps?

> They look for ways to arrange spheres as symmetrically as possible. But there’s still a possibility that the best arrangements might look a lot weirder.

Like square packing for 11 looks just crazy (not same problem, but similar): https://en.wikipedia.org/wiki/Square_packing

JKCalhoun

Three pennies form an equilateral triangle with (of course) 60 degree angles.

Six of those equilateral triangles will perfectly add to 360 degrees. Intuitive enough? (I'm being a little hand-wavey by skipping over the part where each penny triangle shares two pennies with a neighbor — why the answer is not 18 for example.)

For my mind though, the intuitiveness ends in dimension 2 though. ;-)

jansan

It would be fun to make that square packing for 11 from wood and give it to puzzle enthusiasts with this task: Rearrange the squares so you can add an additional 12th square. And then watch them struggle putting even those 11 squares back in.

Jun8

If you are wondering what sort of idiot would dispute a math problem with Newton, it was not a dispute at all; it's not even clear that they had a discussion at all: https://hsm.stackexchange.com/questions/5148/how-did-newton-...

dxuh

> “There may be structures without any symmetry at all,” said Gabriele Nebe (opens a new tab) of RWTH Aachen University in Germany. “And no good way to find them.”

She taught Lineare Algebra II when I took it! It was one of the toughest lectures I took during university. I remember looking to the person next to me and one of us asked "do you understand anything?" and the other said "no! I haven't understood anything for like 20 minutes" and we burst out laughing and couldn't get it together until we were asked to quiet down. Wadim if you hang out here, send me a mail or something!

dekhn

I took a class taught by David Huffman (of Huffman coding) called Cybernetics (IIUC it was the UCSC equivalent of a class Wiener taught at MIT.

The very first day, he started out by talking about kissing spheres and concluded the lecture with "and that's why kissing spheres are easy in 7 dimensions" (or something like that).

Every lecture of his was like being placed in front of a window looking upon a wonderful new world, incomprehensible at first, but slowly becoming more and more clear as he explained. Sometimes I wish I could play in the garden of math.

rpigab

> Mathematicians often visualize this problem in terms of spheres. You can think of each code word as a high-dimensional point at the center of a sphere. If an error-filled message (when represented as a high-dimensional point) lives inside a given sphere, you know that the code word at the sphere’s center was the intended message. You don’t want these spheres to overlap — otherwise, a received message might be interpreted in more than one way. But the spheres shouldn’t be too far apart, either. Packing the spheres tightly means you can communicate more efficiently.

I went to math prep school for 2 years, attended 12 hours of math class in agebra and analysis per week, which I think proves I've done more math than most people in the general population, and this makes no sense to me. It either lacks introduction required to understand the analogy, or I've become really dumb. I want to understand this based on what the article says, but I can't. I can't represent error-filled messages as high-dimensional points. It's easier for me to imagine what the intersection between 4D spheres would look like in geometry.

I found this for anyone interested in understanding 4D spheres without knowing too much math: https://baileysnyder.com/interactive-4d/4d-spheres/

sohkamyung

I'm not sure if this helps makes things clearer, but see this diagram for symbols in Quadrature Amplitude Modulation [1]. The valid symbols are mapped to certain points in the vector space. Now, imagine non-overlapping circles around each symbol. If a received signal falls within a circle, it would be mapped to that symbol in the centre of the circle.

This can be extended to 3-D or higher dimension spaces.

[1] https://en.wikipedia.org/wiki/Quadrature_amplitude_modulatio...

quuxplusone

> I want to understand this based on what the article says, but I can't. I can't represent error-filled messages as high-dimensional points.

Well, start with an analogy. Let's say you and I want to communicate a message, which comes from a set of let's say 4 possible messages: "YES", "NO", "GOOD", and "BYE". Let's further suppose that the medium for this message (the "data channel") is going to be a single point selected from a 2D square. We'll agree beforehand on four points in the square that will represent our four possible messages. Then, you're going to position a dot at one of those points, and I'm going to observe that dot and infer your message from its position.

If the "data channel" is "error-free" (a.k.a. "lossless"), then it really doesn't matter which points we agree on: you could say that the exact center of the square is "YES", the point one millimeter to the left is "NO", the point two millimeters to the left is "GOOD", and so on. But if the data channel is "lossy," then the dot might get shaken around before I observe it. Or equivalently, I might observe its position slightly incorrectly. So we should choose our "code" so as to minimize the effect of this "error."

The best way to do that, on a square, is to place our four "code points" all the way at the four corners of the square, as far away from each other as possible. By "as far away from each other as possible," I mean in the sense of https://en.wikipedia.org/wiki/Pole_of_inaccessibility — I mean we want to maximize the minimum distance between any two points. A mathematician would notice that this is the same thing as maximizing the radius R such that we can draw a circle of radius R around each of our code points without any of the circles intersecting. (R in this case is half of the square's side length.)

If we add a fifth code point, this same reasoning would lead us to place that fifth point right smack in the center of the square. And the sixth point... well, I feel like that gets tricky.

BUT! In actual communications, we don't send messages encoded as real points in 2D squares. We send messages as discrete bit-strings, i.e., strings of zeros-and-ones of length N, which you can see as discrete points at the corners of an N-dimensional hypercube. Then, if we want to send K different messages robust against errors(+), we should pick as our code points some K corners of the hypercube so as to maximize the minimum Manhattan distance along the hypercube's edges between any two code points. This is the basic idea behind error-correcting codes.

A digital error-correcting code is "K code points in a bounded region of N-dimensional hyperspace (namely the discrete set of corners of a unit hypercube), selected so as to maximize the minimum distance between any two of them." The kissing-hyperspheres problem is "K sphere-centers in a bounded region of N-dimensional hyperspace (namely the continuous set of points at unit distance from the origin), selected so as to maximize the minimum distance between any two of them (and then, if that minimum distance is still >=1, increase K and try again)."

If all you meant is "Those two problem statements don't seem 100% equivalent," I think I agree with you. But if you meant you didn't see the similarity at all... well, I hope this helped.

https://en.wikipedia.org/wiki/Pole_of_inaccessibility

https://en.wikipedia.org/wiki/Error_correction_code

(+) — edited to add: Robust against the traditional model of error, i.e., our "threat model" is that any given bit has a constant small probability of getting flipped, so that our observed point may be some random Manhattan walk away from the code point you actually sent. You could instead use a different threat model — e.g. supposing that the bits sent in the actual digital message's "low-order" bits would flip more often than the high-order bits — in which case the optimal selection of code points wouldn't be as simple as "just maximize Manhattan distance."

rpigab

This helped a lot, thanks! I now see a similiarity where I was missing the bridges between geometry and lossy information channels. It's really interesting, though it's a really complex problem.

kazinator

Mathematicians may have discovered it, but it will take an engineer or experimental physicist to capture a trace on the osculoscope.

zython

two best spheres in a room; they might kiss :^)

TaurenHunter

https://knowyourmeme.com/memes/now-kiss

NooneAtAll3

if we believe the article, Li did all the work

and yet Cohn is first on the author list :(

ratmice

Alphabetical order is normal.

http://www.ams.org/learning-careers/leaders/CultureStatement...