Oh for more spare time...
... code that plays an audio sample on a Lisa without a DAC?
I can barely imagine some of the wonderful things you would do if you had more spare time!
Not the same, but I think the simplest approach conceptually is copying pre-processed PWM data from a range of memory to the VIA.
eg. a series of MOVE (A0)+,(A1) instructions, where A0 points to a range of memory containing appropriate PWM data and A1 points to the VIA's PCR register which can set CB2 (the TONE output) high or low.
That instruction takes 16 cycles, and at 5 MHz, that means 3.2 us per write, making the maximum toggle rate about 156 KHz.
If you want hi-fi, then 1/4 of that rate could be sufficient, and by pre-loading 2MB of memory you might get 50 seconds of sound.
However, the pre-processing step seems too compute intensive to perform a real-time conversion from conventional samples of a hi-fi analog signal to PCM with hi-fi quality. Perhaps the PET example has some clever/efficient way to do it, or you can load the samples and wait while it is converted to PWM.
I suppose a custom build of the ESProFile software could allow continuously streaming (rather than sector by sector) PWM data from the parallel port to the speaker, but that's probably cheating.
Another approach is to use the Lisa's 3 bit level/volume control in conjunction with the PWM technique (at the cost of losing volume control

)
eg. use the polarity of a sample to set the TONE signal high or low, then apply 3 bits of the magnitude of the sample to the level control. That would make a 4 bit DAC with a pretty fast sample rate, perhaps adequate for recognisable speech, assuming your source file has the corresponding sample rate.
Or add a jumper (maybe it needs capacitor coupling) from the output of the 6 bit Contrast DAC to Spkr In, and you'll have a high sample rate, retain volume control, and bonus visual effects!
edit: changed PCM to PWM, although I suspect it doesn't matter in this case