However, the last two terms seem designed to prevent any human collaboration
Interesting take...
If that was indeed the intent, I would expect them to oppose publishing
any modifications, which isn't specifically prohibited even though modifications are mentioned as being allowed/expected.
My interpretation of the last phrase is to prevent misrepresenting eg. a modification as coming from Apple, and/or Apple being responsible for it; this topic of not mis-representing your own work as coming from the licensor is common in developer agreements.
I suppose the term "benchmark" may be ambiguous. Although one might lose in court, I think the public would consider publishing a bug fix or new feature, which permits "x" to function in some way that is functionally different from the standard, isn't the same as a benchmark result, which would typically be regarding some kind of 'performance' as a metric. I think this would be worded differently if they meant that one should not discuss the effect of a modification in general.
My impression from the wording is that they don't want 'embarrassing' comparisons of how the code could have been improved in speed or size. I can imagine this coming up in the boardroom where they are discussing the pros and cons of releasing old source code. Is finding an easy fix to a bug the same thing? Maybe, maybe not.
So my current thinking is that I would not be reluctant to publish details of a modification to fix a bug or add a feature, assuming it is done without re-publishing the source code.
Obviously we ought to sort out this issue as best we can, so further thoughts, opinions, and discussion would be helpful.
Ancillary thought: AFAIK, Lisa and the Lisa Logo are not actually trademarks. In general, they aren't accompanied by the informal trademark symbol "TM" or the registered trademark symbol "circle-R", although perhaps they are in some isolated literature.