Skip to content(if available)orjump to list(if available)

The Sky's the limit: AI automation on Mac

AnonC

One doesn’t need to wonder why Apple couldn’t get something done. Apple seems to be in dire need of a shakeup at the top level, but seems to be incapable of seeing the need for that or being able to do that.

From the outside, it looks like there are just power struggles and fiefdoms held by the old guard that have resulted in the stagnation and the worsening quality across its operating systems. Even today you cannot search and find all the things in the Settings app on iOS, which has been a longstanding issue. ScreenTime is totally broken since iOS 18 and shows tons of minutes of apps that I open for just one minute everyday. There are many more irritating old bugs as well as new bugs being added regularly.

The recently rumored to be forthcoming “26” numbering of all its operating systems and the cementing of annual releases with new feature addition doesn’t bode well for improving quality. The way Apple’s software teams have been working, a tick-tock cycle of improvements followed by stabilizing every other year is the only way things can get a little better. That’s a pipe dream for me anyway.

cosmic_cheese

I’ve seen arguments that it largely comes down to the lack of a Jobs-like figure at the top maintaining a coherent, user-driven vision across the company and won’t hesitate to take a hatchet to projects and initiatives that aren’t furthering that vision.

Cook is excellent in terms of managing supply chains and has been good for the company’s bottom line, but he doesn’t have Jobs’ capability to discern what users want/need and what makes a good product, and I doubt he has anywhere near the level of direct involvement that Jobs had.

That’s not to glaze Jobs, he was anything but perfect but he was undoubtedly among the driving forces behind Apple’s quality and knack for building things that people want.

thenthenthen

I am a bit late to the (upgrade) party, but ios 18… what happened?! If you scroll through dates in the Calendar app, random date numbers vanish. Huh. Search in Notes is super weird (shows old, fuzzy matches first). Photos app is just.. wow where am I? How is there no jailbreak yet? (Is there?). I have read here the main issue is fragmentation, but I am not sure how that (not) works. If one team builds, let say Notes.app why is it so broken? Because it recycles search libs from (broken) Photos.app? Sorry for the rant.

Edit: is this to upsell the AI?

jmathai

Thanks for mentioning this. I didn’t connect my general confusion in the Photos app with an upgrade but it makes sense.

The navigation is so unintuitive to me that I’ve killed the app just to try and get back to the most recent photos I’ve taken.

Apple products have mostly been very good. Even when I didn’t love the UX I could figure it out.

So many recent features just feel half baked (like replies in iMessage). I’m starting to wonder if they’re better off leaving good enough alone (not that that is a remote possibility).

bronlund

Yeah! macOS gets more and more like Windows by the minute. I dread the day, in about 700 years, when you can hardly tell the difference anymore.

bee_rider

I think Apple will have trouble course-correcting because there currently isn’t a good OS for sale. Why improve in terms of quality, if there isn’t any competition?

Weirdly, I think they might be smarter than us. Despite being superficially uninterested in hardware, the general public seems to buy hardware, and then just use whatever OS comes on it. Apple seems to have aligned with this by easing up on software quality a bit and obtaining a reasonable (although not incontestable) claim at being the best hardware company with their M-series chips.

oneneptune

I would wager sky.app will find themselves acquired by Apple before the end of 2026.

rcarmo

Again.

VenturingVole

Oh wow, clicking on their "Join Team" button is absolutely amazing. Credit to them.. almost makes me wish I was a Mac developer to apply for a job. Almost.

https://software.inc/jobs

thebruce87m

This gives me a black screen on iOS safari

Edit: working now

daveidol

seems like this is an actual emulator not just a JS approximation

krackers

So snappy, it's almost faster to mac os 8 in a browser than it is to use mac osx.

BryantD

For those who want a better explanation of Sky's capabilities, I recommend Federico Viticci's article (linked from the Tao of Mac article but the links are not distinctive): https://www.macstories.net/stories/sky-for-mac-preview

He goes fairly deep based on two weeks of usage.

rcarmo

People complained about the links being too visible, so they are only visible once per article now, and I make sure to highlight them on hover over a paragraph. I guess I will never please everyone :)

BryantD

Ha! yeah, you can't make people happy all the time. And I mean, I found the link, so I can't complain too much.

ed

Quicksilver[1], app switcher [2]… Sherlock[3].

Now that Apple knows what to build, they’ll build it.

1. https://en.m.wikipedia.org/wiki/Quicksilver_(software)

2. https://www.folklore.org/Switcher.html

3. https://en.m.wikipedia.org/wiki/Sherlock_(software)#Accusati...

smcameron

Oh, weird, I was just thinking of Memory Shift[1] earlier this morning, hadn't thought of it in probably 30 years, and now that switcher article mentions it.

[1]https://www.atarimagazines.com/creative/v9n12/82_MemoryShift...

dangoor

Or maybe they'll buy it, like they did with Workflow.

sajithdilshan

This is what Apple Intelligence should have been. Instead we got Genmoji

break_the_bank

Shameless plug here - we are working in a similar space, and have started at the copy & paste layer. We are calling it TabTabTab[0]; while we have a waitlist as we are working on our onboarding, you can already give it a go on[1]

The main actions are option+c to copy, option+v to paste. Take the front page of hackernews and capture it with option+c (use cmd+a to select all, if text is selected we pickup text, otherwise we pickup an image); and then go to sheets.new or numbers and hit option+v, see your Spreadsheet getting built. It works everywhere.

There's spells option+s (custom GPT per app / URL) and agent mode (option+I) but give magic copy/paste a shot. More examples here[2]

[0] https://tabtabtab.ai/

[1] https://tabtabtab.ai/TabTabTab.dmg

[2] https://tabtabtab.notion.site/tabtabtab-user-notes?pvs=4

bronlund

I like shameless plugs that are honest about it. I'll surely take a look :)

break_the_bank

I will email you with the link, we don't have that bit automated. Here it is in the meantime - https://tabtabtab.ai/tabtabtab.dmg

skybrian

It seems like a power tool for early adopters who are willing to take a few risks and something safe for the masses are two different things?

Apple has a target on its back. Anything they do absolutely will be hacked. There are unsolved problems securing tool-using LLM’s.

I wonder what Sky does for security?

daft_pink

It’s really obvious that AI is something that is very rough around the edges and companies known for delivering polished projects are having a lot of trouble delivering. For example, Google should have been the leader in AI, but I think OpenAI got first mover advantage, because they didn’t want to ship a product that hallucinates. If you look at the pre-ChatGPT days, all the news stories were what happens when AI goes wrong not forgiving the failures and accepting that in some areas it can get things really right in a more efficient way.

K0balt

I’m interested in something in this space that can work with local models. I absolutely do not trust inference providers with my data without very careful risk analysis.

bigtones

Sky honestly does not even look that impressive from the video on their website. So it can save calendar events if you type something into a prompt. Wow....

What am I missing ?

sgt101

My brother, you haven't drunk the right coolaid.

It's not impressive because text is a shit interface for interacting with a machine. GUI's are much better and we established this in like 1982.

Also what they are doing in that demo is inserting a machine shim in an interaction between people, which is what people, time and again, say they do not want.

dawnerd

Doesn’t macOS already have add to calendar when it supports dates? I too was confused why they used that in particular. Nothing they Showed made me go wow that’ll save me time. And what makes any of that “ai”?

It really seems like a solution in search of a problem.

rcarmo

It is inferring the UI content. There's a bit more to it than it looks at first glance.

krackers

What does "inferring the UI content mean". You said you have some knowledge of how it pulls it off, is it using accessibility API and apple events behind the scenes? Those are the only two APIs I know of on mac that can do this (besides just grabbing the pixels of a window).

I've always been hoping that the rise of LLMs leads to a resurgence in apps being able to respond to apple events, since it seems like a perfect fit.

ndr42

To ask it to organize the download folder seems great - more examples: https://www.macstories.net/stories/sky-for-mac-preview/

skydhash

Nothing really! Saving one or two clicks for people that can't bother to learn any UI. The true automation I needed which leads me to leave macOS is workspace arrangement. Now I'm using Sway on Linux and I now that Super+1 is Emacs, Super+2 is the browser, Super+3 is where the terminals are,... Typing the same shortcut again brings me in the last workspace. And no sloooooow animations in the way.

IggleSniggle

This stuff is readily available in the macOS ecosystem, but I don't fault anyone for jumping ship for these reasons. It's the same reason I did, before I figured out how to do it in macOS land by way of being required to work on macOS anyway.

skydhash

I know how to them in macOS, it's just not pleasant. I still use macOS, but for very specific workflows. My daily computing is on Linux where I got everything to behave how I like.

1123581321

You can set all that up on macOS and disable animations.

rcarmo

It's not about the app, it's about it existing outside of Apple and their push for Apple Intelligence.

throwaway314155

I mean the article claims that it's good - better than all other existing solutions, and hence why it's worth talking about in the first place.

rcarmo

It is _much_ better than all the other MCP servers I tried for the Mac so far (I've been doing one myself and know a lot about Mac automation, so I've benchmarked them, and none can do the kind of UI analysis I see in the demos).

break_the_bank

i am building in a similar space, while i see where you are coming from I am extremely curious on what "a mind blowing" or "i need that" experience looks like to you from a desktop AI product

cypherpunks01

Seems that it can also text your friends (so you don't have to)

null

[deleted]

yreg

Unrelated to the article content:

The styles on this blog overwrite repeated links with the color of the main text (see `.duplicate`). What's the design idea behind this? I doubt it holds, since the links are still there for keyboard users, etc.

Either make your hyperlinks visibly distinct (ideally underline them) or remove them.

rcarmo

I did that because people complained that the links were too visible (because the site is actually a Wiki, there are often several copies of the same link throughout the text).

Over the past 20 years there have been many schools of thought about it, and the only way I could get people to stop kvetching about it was to have the links become more distinct on paragraph hover but still render them in a distinct color (and oddly enough, the rate of complaints went way down when I shifted to dark red).

I have no sane explanation for this other than ascribing it to user-driven UX research over a massive amount of time (when compared to most other websites I know).

But (fun fact) the duplicate links inside paragraphs do wonders for RAG chunking and vectorization. :)

yreg

Ha, did you just update the styles based on my complaints? :)

I believe there are multiple reasons why the styles are better now. Even the first-use red links are much more accessible now. And you are now presenting the same content to the users using screen readers or other assistive technologies as for the typical users.

By the way, I think your analysis of the Apple situation sounds absolutely spot on.

rcarmo

And I did that with just plain HTML, which is madness in today’s web :)

bronlund

If you have already been there, why would you go again?

yreg

No human visitor opens all of the links from top to the bottom, so the repeat occurrence might very well be unvisited. After all, we have `:visited` for that.

But even if you are questioning why anyone would click it -> well why have the link there in the first place then?

guestbest

I’m ok with owning a Mac if it only means that AI tools don’t have root access to the machine.