01
Lolin D32
For the brains I used the Lolin D32. This was simply because
I still had thisboard laying around. If anyone would try to
work the same project, I would recommend them either a similar esp32
board or a board with more RAM. The Specs were enough for what I
needed to do but it's tight.
02
Microphone & Speaker
In order for the Porygon to be able to listen to what I am saying
I needed a microphone. The INMP441 is often used and was readily
available. Most voice assistants are also able to talk back. To
achieve this I chose the MAX98357. This allowed me to use 4-8 ohm
speakers, which I still had laying around.
03
Edge Impulse
There were a couple of options for wake word detection. These
options include ESP-SR, Streaming it, Locally transcribing or
Edge Impulse. Edge Impulse took the most time to install but was
also best tuned for my application.