I made a 10¢ MCU Talk
31 comments
·October 29, 2025NoiseBert69
MisterTea
Nothing irks me more than "check out my neat-o PCB design" and there's no schematic.
amoose136
It's kicad which means you can use kicanvas to view it. For example: https://kicanvas.org/?github=https%3A%2F%2Fgithub.com%2Fh0la...
jimmyswimmy
What an amazing tool: it loads and displays the kicad native files in a simple web browser, but moreover - you don't have to be the intermediary, cloning some repo and then uploading the individual files to a website.
I wish that existed for more weird binary formats. Altium have 365 but you have to have signins to use it, and they cost named-user seats.
MisterTea
I use KiCAD so I know it well. Unfortunately that site is in alpha and I am unable to zoom in to see the schematic clearly.
NoiseBert69
Oh.. I love this web tool. Thanks for showing!
NoiseBert69
Feel free to have a look at the "pcb" folder.
MisterTea
Sorry, but I am not installing KiCAD or cloning a repo just to look at a schematic. Since the beginning of time electronics hobbyists have been posting schematics in bitmap or pdf format. It should be in the readme.
amelius
Can you change the polarity (direction) of the DC motors with this board?
NoiseBert69
Yes. The DRV8837 has a pin for the direction. It's a H-Bridge.
kragen
It's probably worth mentioning the 2400bps (300 bytes per second) LPC10 codec built into SoX. If you have SoX installed, try
rec -t lpc10 speech.lpc
and then speaking into your microphone for ten or fifteen seconds before you ^C it. Then play it back with play speech.lpc
It will sound very robotic but pretty comprehensible, at least with an adult male voice in English, and it preserves a lot of the prosody and enunciation that is so hard to get out of speech-synthesis packages.12KiB of data at 300 bytes per second would be 41 seconds of recorded speech.
Decoding the LPC10 data on the CH32V003 might be tricky. On amd64, running `make CFLAGS=-Os` followed by `ld -r -o tmp.o *.o` inside sox-14.4.2+git20190427/lpc10 yields a tmp.o with 25243 bytes of text (including .rodata, etc.) and 356 bytes of data. I'm not optimistic that RISC-V would compress that to fit inside the CH32's flash. And I find the code in that directory inscrutable; it's Fortran that's been compiled to C.
Still, it seems plausible that you could massage the LPC10 data into a format that something like Talkie would understand.
thrtythreeforty
This is great. Missed opportunity for a low-pass RC filter on the speaker circuit - if you know you're driving an 8kHz sample rate, you can design your filter with that cutoff, and it'll sound way better (it'll get rid of the buzzy quality).
kragen
This may be essential if you're connecting it to an audio amplifier. I learned this the hard way by burning out someone else's very expensive tweeters with 31.25kHz PWM.
pjc50
Nice work. Especially referencing the TI prior art of the Speak and Spell. This kind of synthesis was quite prevalent in the early 80s - school BBC Micros had a ROM which let you "*SAY" a phrase. Classic Macs had MacinTalk.
Another codec which might be interesting to try but is considerably more complicated is AMR, from GSM: https://en.wikipedia.org/wiki/Adaptive_Multi-Rate_audio_code...
ctoth
You should be able to do it all on-device, check out SAM, the Software Automatic Mouth. The actual data in the *_tabs files:
thomassmith65
The sound in the video seems more sophisticated than TTS. It seems more like the result of analyzing a clip of digital audio, and turning it into a series of TTS phonemes.
Assuming SAM is a faithful port of the original, it converts text into phonemes according to a bunch of pronunciation rules.
dlcarrier
Can you export the schematic and Gerber files to a PDF file? A lot of open source projects do this, and it makes it much easier to tell what's going on, with software pretty much everyone already has on their computer.
docdeek
Interesting, though before clicking I thought the headline might be referring to a very poorly paid presentation about Marvel movies.
pdntspa
Ha, seconded. I hate how acronyms get repeated between domains. Makes for very confusing reading if you're not part of the in-group.
zahlman
> I considered a few encoding options for compressing the audio.
The presentation of this part seems extremely padded out to me, ironically enough.
MisterTea
Could hang an i2c flash chip off that thing for more storage and still have enough IO pins for serial coms and a spare IO pin.
Findecanor
I saw another audio project on the same microcontroller (family) posted a few days ago: ModPlayRISCV It plays a tracker MOD. using PWM with a low-pass filter. It resamples/scales all samples at varying rate/volume into a ring buffer which gets fed to the PWM comparator by DMA.
null
colechristensen
First I thought you made a lecture on MCUs which was available for viewing in exchange for $0.10
Then I thought you made a lecture on MCUs where the device was available for purchase generally for $0.10.
Then I thought with an MCU valued at $0.10 you generated speech
English... sigh
SJC_Hacker
This is why I prefer μC
Although I guess that can also be confused with micro Couloumbs
SideburnsOfDoom
It has worse misreadings, it could be a cheap lecture on the "Marvel Cinematic Universe". (It's about cheap MicroController Units.)
These CH32 mikrocontrollers are great and dirt cheap. I've build a small DC motor controller with them to control a robot: https://github.com/h0lad/MiniSpeedController
The bigger ones have PHYs for USB HS, USB-C (5Gbps) and 10/100M Ethernet integrated (!). And their development environment (Mounriver Studio) isn't too bad - I didn't had the immediate urge to port everything to CMake/VSCode.
But they need some kind of pin planning tool. It's awful to use the datasheet and find the correct pin functionalities and their mutual exclusions... STM32 mastered this with their STM32CubeIDE tool: select a feature (like USART1) and the right pins light up - alternate pins are easy to locate.
They also should clean up their license mess on OpenWCH (their GitHub page). Lots (all?) of their HALs are Opensource - but the right version with right SPDX tags are often a bit hidden.