Abstract
Single Input Multiple Data (SIMD) provides data parallelism execution via implemented SIMD instructions and registers. Intel's SSE Family Instructions are relatively well understood among C/C++ programmers hence implementing SIMD enhances software library. However, the barriers to entry Programming for ARM NEON has been made difficult to many programmers who only understand the Intel SSE family of compiler intrinsic. Since ARM does not share a common API to target. The cost of development is compounded since ARM compiler intrinsic and Intel Compiler Intrinsic incomparable forcing multi-platform optimizations to be have to be developed separately. Though the creation of an Open Source implementation of Intel's SSE Family API using ARM NEON compiler intrinsic developers will be able to cut costs and increase software stability though the use of a single standardized API.