Code design

General framework

The program currently contains 7 basic moduli:

  • dem: ingradients of DEM, including DEM solver, contact solver, contact models, and etc.
  • domain: defines the computational domain
  • input: used to interpret the input file. It is planned to adopt the json format as the data exchange format. Each json object corresponds to a command, and the simulation is constructed or modified through the execution of the command. The commands in plan definition 6: create, get, set, fix, unfix, run.
    • create: create objects such as shape, particle, wall, contact model, modifier, etc.
    • get: get object properties
    • set: set object properties
    • fix: Bind the defined modifier to simulation to activate.
    • unfix: Unbind the defined evalua to make it invalid.
    • run: start discrete element calculation
  • modifier: defines some personalized calculations of the discrete element model, such as adding external forces, exporting data, etc. Because these personalized calculations are not required for every calculation model, they can be added to the model freely through this subscription method. Generally, the modifier is defined first, and then attached to a simulation and activated. When the discrete element is calculated to a certain node, the subscribed modifier will be called uniformly. Currently, two modifiers are defined: pre_modifier and post_modifier. pre_modifier is called at the beginning of each calculation cycle, and post_modifier is called at the end of each calculation cycle.
  • mpi: data interaction between different parallel computing cores
  • scene: contains particle, wall, contact model, modifier, etc.
  • shape: defines the shape tmplates, including sphere, ellipsoid, cylinder, spherical harmonics, polyhedron, level set, and etc.
  • utils: other tool functions

Other moduli (as of Dec. 26, 2021):

  • mlpack: wrapper of mlpack for machine learning exploration
  • peridigm: wrapper of peridigm for particle breakage analysis
  • pybind: for python interfaces

Basic elements

  • Particles:

    • Particles have variables or attributes such as shape, position, speed, and force
    • The movement of particles obeys Newton's law of motion
    • The particles are in contact with each other to generate contact force, which is calculated according to the corresponding contact model
  • Wall or boundary:

    • The boundary has variables or attributes such as shape, position, and force
    • The boundary is generally assumed to be massless, so there is no need to calculate - Newtonian motion, just update its position according to the preset motion law
  • Contact:

    • particle-to-particle, or particle-to-boundary contact
    • with contact geometric characteristics, contact force and other attributes
    • The contact model describes the relationship between contact force and contact geometric characteristics

Calculation procedures

The basic calculation process of discrete element (DEMSolver calculation process):

  • Traverse all the "particles" in this calculation domain, and clear the force and torque to zero.
  • Perform pre_modifier, such as gravity.
  • Traverse all the "particles" in this computational domain to determine whether its contours touch other computational domains: if so, use MPI to transfer its data to the computational domain, and create a new particle in the target computational domain based on the particle data, as The particle is a proxy in the target computing domain, and the proxy particle can come into contact with other particles. The particles in this computing domain are called "particle agents to be sent", and the particles in the target computing domain are called "received particle agents".
  • Receive the "particle agent" data and create an instance of the particle agent:
    • If this calculation domain already contains the id of the particle agent, find the particle pointer through particlem_map, and update the particle instance with the received particle agent data;
    • If the id of the particle agent does not exist, create a new "particle_ghost" instance, and update the particle instance with the received particle agent data;
  • Traverse the "granular agent to be sent" and send its corresponding "contact data" to the target computing domain.
  • Receive contact data and create contact instances:
    • Determine the pointer of the particle according to the particle id, and determine the pointer of the contact model according to the contact model id
    • Determine whether the contact exists through the contact_pp_list of the particle
      • If it exists, update the contact instance from the received contact data.
      • If it does not exist, establish a contact first, and use the received particle agent data to update the particle instance,
  • Traverse all contacts and set their updated to false.
  • Divide the computational domain into grids, traverse all the "particles" and "received particle agents" in the computational domain, and classify them into each grid. The classification criteria are: If the circumscribed cube of the particle or particle agent contacts a certain grid, then the particle or particle agent is added to the grid.
  • Traverse all grids:
    • Traverse the particles and particle agents in this grid to determine whether they are in contact. If touched:
      • Determine whether it was in contact at the previous time step
        • If yes, find the contact instance, update its contact geometric features and contact force, and set the updated of the contact to true
        • If not, create a new contact instance, initialize the contact geometric features and calculate the contact force, and set the updated of the contact to true
      • Apply contact force to particles and boundaries
  • Traverse the "received particle agent":
    • Traverse the contact of the particle agent, if the updated state of the contact is true, use MPI to send the contact data back to the original computing domain
    • Receive contacts returned from other computing domains and reconstruct contact instances
      • If the contact already exists, use the returned data to update the original contact
      • If the contact does not exist, create a new contact and rebuild the particle pointer in the contact
        • If the particle exists in the computational domain, the particle pointer is found through particle_map and particle id.
        • If the particle does not exist in the computational domain, create a new "particle illusion" and set the "particle illusion" pointer to the contact.
    • Apply contact force to particles
  • Traverse all contacts
    • If its updated status is false, delete the contact;
  • Traverse all particles - Update its speed, position and other attributes according to Newton's law of motion
  • Add the partial_proxy_list to the particle_ghost_list and clear the partial_proxy_list
  • According to the updated particle position, traverse each particle to determine whether it exceeds the calculation domain:
    • If so, send its data to the new computational domain of the particle, and move the particle to particle_ghost_list.
  • Receive the "particle" data and create an instance of the particle:
    • If this calculation domain already contains the id of the particle, find the particle pointer through particlem_map, and update the particle instance with the received particle data;
    • If the id of the particle does not exist, create a new particle instance and update the particle instance with the received particle data;
  • Traverse particle_ghost_list
    • If it is not in contact with any particles or boundaries, delete the "particle phantom"
  • Perform post_modifier, such as data output and other functions.

Coding style

We generally follow Google C++ Style Guide.

Naming

  • Filenames: lowercase words connected by underscores, e.g. particle.hpp, contact_pp.cpp.
  • Variable names: lowercase words connected by underscores, e.g. dir_n.
  • Class and struct names: words with first letters capitalized, e.g. DataDumper. Word following abbreviations shoudl be capitalized accordingly, e.g., GJKsolver, DEMgjk, GJKdemGJK, etc.
  • Macros: should be capital, such as PI.

Comment

  • Comment is not a requisite, but please add it if an attribute or method is not self-explainary or is ambigious.
  • We use doxygen to generate the code documentation. We suggest the following comment format.

    • Block documentation (e.g., for class description):

      cpp /** descriptions */

    • Line documentation:

      cpp /// descriptions

    • Other commands if approperiate: @warning {warning message}, @todo {things to be done}, @bug , @brief, @var、@enum、@struct、@class、

Formatting

Programing rules

  • Use auto for local variables when appropriate.
  • Mark const when appropriate.
  • Reference vs. pointer:
    • If a variable will not be altered after calling the function, use reference with const mark, e.g., const double &[variable].
    • If a variable will be altered, use pointer. For int or double, as well as lists of int or double, mark with const (e.g., double *const [variable]) to aboid mistakenly modifying the pointer.
    • If a variable will not be altered but its value will be passed and stored the calling instance, use pointer.
  • Following the previous item, if you are going to modify a variable, please declare it or passing it as an argument with with &.
  • Prefer use c++ std library rather than c library, e.g., use <cmath> rather than <math.h>.
  • Avoid using smart pointers, such as std::unique_ptr, std::shared_ptr.
  • Never ever use "using" (e.g., using namespace std) in headers.

Performance evaluation

Procedures

  • Tool: linux perf

  • To probe the performance:

    sudo perf stat build/bin/tmp_debug
    

    Example performance counter stats:

         12,639.97 msec task-clock                #    0.983 CPUs utilized
             1,284      context-switches          #    0.102 K/sec
                42      cpu-migrations            #    0.003 K/sec
            13,356      page-faults               #    0.001 M/sec
    26,566,855,696      cycles                    #    2.102 GHz
    56,713,820,846      instructions              #    2.13  insn per cycle
    10,275,875,523      branches                  #  812.967 M/sec
        31,362,265      branch-misses             #    0.31% of all branches
    
      12.857522834 seconds time elapsed
      11.472000000 seconds user
       1.120000000 seconds sys
    
  • To sample CPU usage:

    sudo perf record -e task-clock -g build/bin/tmp_debug
    sudo perf report -i perf.data
    
  • To generate heat map and visual:

    git clone --depth 1 https://github.com/brendangregg/FlameGraph.git
    FlameGraph/stackcollapse-perf.pl out.perf > out.folded
    FlameGraph/flamegraph.pl out.folded > out.svg
    sudo perf script > out.perf  
    # then upload out.perf to https://www.speedscope.app/ to visual