We have implement data-parallel primitives on the hybrid dataflow/von newman parallel computer EM-4. To evaluate the performance of these primitive, we compare the to the idential primitives, running on CM-200 SIMD parallel computer. For integer arithmetic element-wise opeations, EM-4 is faster than the CM-200 when two or more operations can be combined. For communication operations, EM-4 has significanly higher performance. EM-4's distinguishing feature in running data-parallel codes isits excptional communications performance in terms of network interface. Additional special-purpose hardware for barrier synchronization and scan-like operations is not necessary. Dataflow-style token synchronization is helpful, but not necesary in implementing data parallel primitives