...
To alleviate this imbalance, OpenFF will provision its own host and deploy QCFractal Server to it for future calculations. This will also give OpenFF operational independence in dataset lifecycle and management, allowing the organization to adjust its usage patterns as needed to meet its needs.
Server
...
Requirements
A recommendation for server specifications for an OpenFF QCFractal Server is as follows:
chassis : Dell EMC PowerEdge R7525 Rack Server (or HPE equivalent)
motherboard : PowerEdge R7525 Motherboard, with 2 x 1Gb Onboard LOM,MLK V2 (or HPE equivalent)
cpu : 2 x AMD 7543 2.8GHz,32C/64T,256M,225W,3200
this could likely be cut in half to 2 x 16 cores == 32 cores
memory : 16 x 32GB RDIMM, 3200MT/s, Dual Rank 16Gb BASE x8
total memory: 512GB
storage : 24 x 7.68TB SSD SAS ISE Read Intensive 12Gbps 512e 2.5in Hot-Plug, AG Drive
we won’t need this much storage, which comes to 92TB of usable RAID10 storage
we could get away with at minimum 1/4 of this, or 23TB of usable RAID10 storage
we would target 50% utilization of storage at all times, setting dataset lifecycle / retention policy accordingly
This recommendation draws from Ben Pritchard’s experience in managing the MolSSI QCFractal Server for over two years, as well as from his own testing cycles of the upcoming next
refactor of QCFractal.
Network Requirements
The server will need to be internet-accessible via the following ports:
http : 80
https : 443
ssh : 22
The server should be reachable via qcfractal.openforcefield.org
.
Backup Requirements
We will set up automated backups, full + incremental. These will need to be shipped to a remote resource, though offsite is not critical. An NFS or SMB/CIFS mounted network filesystem would be ideal. We will likely leverage pgBackRest
(https://pgbackrest.org/ ) for backup generation.
Maximum backup storage allocation should at least equal the working space of the production host. If the production host has 23TB of working storage space, then the backup solution should have ~24TB of space. This will allow for some accumulation of incremental backups over time as storage utilization rises.