I have data items 1,2,3 ..... n I need to run a sequential calculation with all the data items. the value of n is very large, about 600,000 or more. the data is taken from a text file that is usually more than 2GB in size
I have java program to perform the calculation in a loop.The processing time usually takes more than 24 hours. I need to use clusters to minimize the processing time and to distribute the job to different cluster nodes.
currently i am performing parallel processing in my local computer with 4 cpu cores. The work is spitted into pieces and given to 4 cores. when one core finishes a piece of the work, next pieces is loaded. So, there will be a queue and 4 cores processed the queue in parallel.
Which cluster application is good for java in the enterprise level ? Do i need to change my program code? Does the cluster program take care without modifying the java code? How can i split the job and distribute the job to different clusters? Do i need to upload data file to all the cluster nodes?
I will be greatly thankful to your help.