Skip to content

Zardoz89/nBodySim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nBodySim : N Body problem Simulator

nBodySim is a small simulator programa capable of handling huge number of bodies (>1000). It's capable of using multiple cores of a modern many-core CPU. For this, nBodySim is programed using D Language 2, using the std.parallelism library.

This software was compiled with gdc 4.6.3 and dmd 2.0.65, and tested on some machines using Ubuntu x64 14.04, 12.04, 12.10 and Linux ROCKS 6.0 x64.

To build it, use DUB. So do a simple :

dub build=release

It will generate the program using parallel for. If you like to compare with the serial version, you can launch dub with --configure=nBodySim-serial

Usage

./nBodySim [OPTION]...
  • -i, --input=FILE Input file with initial positions and velocity of every body
  • -o, --output=FILE Output data file
  • -d, --deltat=NUM Delta of Time used in each step. By default is 86400 [s]
  • -n, --num=NUM Total number of steps. by default is 36500, that with default DeltaT, is 100 years of simulation
  • --points=NUM Defines every few points that are written to output file. 1 means that are used all points, 2 that 1/2 of the total points not are written. By default is 4
  • -e, --epsilon=NUM Softening factor used that avoid NaNs and Inf in calculations. By default is 1000 meters
  • --rand=NUM If not are using a input file, then generate NUM random stellar mass bodies in a cube of 10 LY
  • --seed=NUM Set a seed for the random generator
  • --center=NUM Set the output coords relative to the entity NUM Useful to centre around a particular entity, like the Sun
  • -h, --help Show this help

Note : If it is used without any option, will try to open ssolar.sim file

Examples

  • Execute a simulation of 100 years with a DeltaT of a 1 day of our solar system fixing the Sun in the (0,0,0) position ./nBodySim --input ssolar.sim --center 0 --output ssolar.out
  • Execute a simulation of 10000 years with a DeltaT of 100 days of a ranmdon set of 500 stars ./nBodySim --rand 500 -d8640000 --output rand.out

Output files

For the moment, nBodySim it's outputing files that are compatible with GnuPlot, where in a row we can found the position of every object in the simulation for a determinated moment ofthe simulation. It's easy to plot hte data using 'splot' command of gnuplot, but be carefull about how many points (rows) are stored in the file, becasue GnuPlot can be easily swamped. I recomend using --points to skip points to be stored in the output file, and keep it easy to hnadle for the gnuplot at same tiem that allow to run long (big number of iterations).

View output on gnuplot

View the output data of the simulation of our solar system

set zrange [-8e+12:8e12]
set title "Solar System : 100 years"
splot "ssolar.out" u 1:2:3 t "Sun", \
"ssolar.out" u 4:5:6 w dots t "Mercury", \
"ssolar.out" u 7:8:9 w dots t "Venus", \
"ssolar.out" u 10:11:12 w dots t "Earth", \
"ssolar.out" u 13:14:15 w dots t "Moon", \
"ssolar.out" u 16:17:18 w dots t "Mars", \
"ssolar.out" u 19:20:21 w dots t "Jupiter", \
"ssolar.out" u 22:23:24 w dots t "Saturn", \
"ssolar.out" u 25:26:27 w dots t "Uranus", \
"ssolar.out" u 28:29:30 w dots t "Neptune", \
"ssolar.out" u 31:32:33 w dots t "Ceres", \
"ssolar.out" u 34:35:36 w dots t "Pluto", \
"ssolar.out" u 37:38:39 w dots t "Haumea", \
"ssolar.out" u 40:41:42 w dots t "Makemake", \
"ssolar.out" u 43:44:45 w dots t "Eris"

TODO

  • Make a animated viewer of generated data.
  • Make a interactive mode that work with the viewer.
  • Add an option to use relative coords around the barycenter of the all system.
  • Implement collisions.
  • Add other output format that can carry more usefull information.
  • Add a way to do electroestatic simulations.
  • Add a way to use a arbitrary precision calculations (GMP).

Benchmark data

10 iterations :

FX-4100 quad core times :

Bodies : 1000
Serial = 573 ms
ParallelFor = 192 ms -> SpeedUp = 2.98

Bodies : 3000
Serial = 5106 ms
ParallelFor = 1745 ms -> SpeedUp = 2.92

Bodies : 10000
Serial = 57726 ms
ParallelFor = 20023 ms -> SpeedUp = 2.88
~72% of max teorical speedup

Opteron 16-core times :

Bodies : 1000
Serial =  682 ms
ParallelFor = 67 ms -> SpeedUp = 10,12

Bodies : 3000
Serial = 6083 ms
ParallelFor = 474 ms -> SpeedUp = 12,83

Bodies : 10000
Serial = 69728: ms
ParallelFor = 5232 ms -> SpeedUp = 13,32
~83.25% of max teorical speedup

FX-8370E overcloked at 3.6Ghz . dmd 2.090

Bodies : 1000
Serial = 119ms
ParallelFor = 19ms -> SpeedUp = 6,26

Bodies : 3000
Serial = 1078ms
ParallelFor = 168ms -> SpeedUp = 6,41

Bodies : 10000
Serial = 11945ms
ParallelFor = 1841ms -> SpeedUp = 6,48

R7 3700x

Bodies : 1000
Serial = 43ms
ParallelFor = 5ms

Bodies : 3000
Serial = 387ms
ParallelFor = 35ms

Bodies : 10000
Serial = 4320ms
ParallelFor = 376ms -> SpeedUp = 11.48
~71% of max teorical speedup

i7-6700 4 core 8 threads

Bodies : 1000
Serial = 212ms
ParallelFor = 39ms 

Bodies : 3000
Serial = 1868ms
ParallelFor = 354ms 

Bodies : 10000
Serial = 20745ms
ParallelFor = 3761ms -> SpeedUp = 5,52 
~69% of max teorical speedup

With 1000 iterations :

FX-4100 quad core times :

Bodies : 100
Serial = 571 ms
ParallelFor = 234 ms -> SpeedUp = 2,44

Bodies : 1000
Serial = 57373 ms
ParallelFor = 19987 ms -> SpeedUp = 2,87
~71.75% of max teorical speedup

Opteron 16-core times :

Bodies : 100
Serial = 676 ms
ParallelFor = 296 ms -> SpeedUp = 2,28

Bodies : 1000
Serial = 67505 ms
ParallelFor = 5921 ms -> SpeedUp = 11,40
~71.25% of max teorical speedup

FX-8370E overcloked at 3.6Ghz . dmd 2.090

Bodies : 100
Serial = 600ms
ParallelFor = 129ms -> SpeedUp = 4,651

Bodies : 1000
Serial = 60063ms
ParallelFor = 9314ms -> SpeedUp = 6,448
~80% of max teorical speedup

Bodies : 3000
ParallelFor = 83770ms

R7 3700x 8 core 16 threads

Bodies : 100
Serial = 220ms
ParallelFor = 42ms

Bodies : 1000
Serial = 21886ms
ParallelFor = 2087ms -> SpeedUp = 10.48

i7-6700 4 core 8 threads

Bodies : 100
Serial = 210ms 
ParallelFor = 60ms 

Bodies : 1000
Serial = 20706ms 
ParallelFor = 3852ms -> SpeedUp = 5.37
~67% of max teorical speedup