Grabbing the custom Sum Op from first session, add vectorization dispatch and subtensor rewrite for "scalarization"