I had a similar idea, taking a page from the propeller chip. Would it be possible to have two cores with some shared memory or perhaps a special bus to communicate between them? That might make the product more interesting, if you could make the execution unit run two separate data streams but twice the speed of a normal Arduino. Having separate state registers would be the trick.
Like the parallax you could have one core operate a,or operate as a peripheral.