Abstract
In this paper, several versions of a signal extraction algorithm, pertaining to the entry stage of the Cherenkov Telescope Array's Real Time Analysis pipeline, were implemented and optimised using SSE2, POSIX threads and CUDA. Results of this proof of concept let us gain an insight into the suitability of each platform, and the performance each one can deliver, to carry out this particular task.