Release Notes
Fsched 10.106
Updated: 2025.12.31
- slurm:
- Added license server monitoring and license allocation management for the current cluster
- Added adaptive scheduling
- Added the
AllowUsersparameter for partitions - Enhanced
fsched_list_job()to support more pagination options - Added QoS denial behavior for "by job" and "by user"
- Limited
wckeyinput length to 42 characters - Added the
checkpoint/criuplugin for job checkpoint and restore operations, supporting incremental checkpoints, pre-dump, and load-aware delays - Enhanced
cli_filterto support wrappers and the custom field API - Fixed issues:
- Fixed a problem where, if a user submitted a job without using
-cto specify CPU cores, then afterslurmctldrestarted the core count was incorrectly stored as0xfffe, causing abnormalCPUsPerTaskdisplay and incorrectbjobsoutput - Fixed crashes caused by
sview,cpus_per_taskpersistence,gres_detail_str, and related issues
- Fixed a problem where, if a user submitted a job without using
- wrapper:
- Added support for dynamic license accounting
- Enhanced
fslsprocwith tree display and conflict detection - Added
btopandbbotcommands to change job order - Added
bpeekto view the stdout/stderr of running batch jobs - Added
fsopt, supportingbsub,qsub,sbatch, andsrun; supports both interactive and batch commands - Enhanced
lshoststo support the-l,-T,-a, and-Roptions, with filtering by host or cluster - Added support for the
fsquotacommand to display resource quotas and limits- Displays accounting association limits and QoS policies
- Displays current resource usage for jobs, CPU, memory, nodes, and GPU
- Supports filtering by user, account, and QoS
- Provides JSON output for programmatic access
- Added a new
cli_filteradapter compatible with LSF and SGE- Supports the
bsub,qsub,qsh, andqrshcommands - Adds custom fields to distinguish wrapper jobs from native SLURM commands
- Adds comprehensive documentation including user guide, design, and custom fields
- Supports the
- Added the
-jsonoption to support JSON output- Added 6 custom output fields:
account,requeue,tmp_disk,min_nodes,max_nodes, andntasks_per_node - Expanded custom field support to a total of 86 field names, including 71 standard fields and 15 aliases
- Improved field formatting and compatibility with LSF
- Refactored the internal implementation to improve maintainability
- Added 6 custom output fields:
- Enhanced the
statesvcserviceListJobsAPI with a force-refresh option bsub- Added the
-envoption, supporting the full LSF syntax:all,none,selective,exclusion, andassignment - Added support for
-H(suspend job),-Ne(exit notification), and-ti(orphan process termination) - Added
ulimitsupport, including-M,-C,-c,-D,-F,-S,-v,-p,-T, and-ul - Added support for
fsiod, a native x11/stdio forwarding system. Experimental. Advantages: small footprint, fully asynchronous behavior, LSF-compatible behavior, and about a 10% performance improvement forsrun -x11 bsub -wsupports usingJOBNAMEas the job condition, allowing scripts to query job state directly byJOBNAME- Supports the
done(job_name),ended(job_name),exit(job_name), andstarted(job_name)syntax
- Supports the
- Added the
- Fixed issues:
- Fixed the lack of server-side user filtering in
qacct, where each request fetched the full dataset and wasted bandwidth and memory - Fixed an issue where
qacctwithout a specified job ID only fetched data and did not print results - Fixed pagination when
qacctqueried server data - Fixed an issue where
fsjobsdisplayed only the current user's jobs by default
- Fixed the lack of server-side user filtering in
Fsched 10.96
Updated: 2025.09.25
-
slurm:
fsched ping: added checks for jobs in the pending statefsched list jobsAPI: added filter conditions forcomment,wckey,group_id, andnode_name- Removed erroneous logs for the
CgroupAutomountconfiguration option - Added the
job_submit/intelliparamsjob submission plugin - Changed
CR_LLN "load"to use a ratio rather than available CPU count - Added the
FairshareUsedfactor, calculated from resources already consumed - Added the
--extoption tosshareto include theFairshareUsedfield - Added the
--extoption tosprioto include theFairshareUsedfield - Fixed issues:
- Removed proactive loading while loading job information to avoid
slurmdhangs - Used the generic
_get_avail_mapfor batch job binding to fix node ordering during terminate job requests - Used connection, send, and receive timeout settings when fetching job details to fix
statesvchangs - Removed the
_accesscheck to fix permission denied errors forprologandepilogtasks when usingroot_squash - Stopped sorting node names in
_job_testto fix CPU binding issues - Fixed a race condition that caused
slurmctldto crash during automatic scaling - Adjusted the log level for task cgroup errors
- Removed proactive loading while loading job information to avoid
-
wrapper:
- Added the
bswitchcommand to switch pending jobs to another queue - Added the
bstopcommand to stop running jobs - Added the
bresumecommand to resume stopped jobs - Added the
bhistcommand to display job history - Added the
lsinfocommand bhosts: added the-a,-aff,-alloc,-e,-x,-X, and-Roptions; added filtering bycluster_name; fixed status display for the-land-moptionslsload: added the-I,-w,-l,-N,-E,-R, and-aoptions, and added filtering by host or clusterstatesvc: added expanded node lists (expanded nodelists)- Added support for mapping
bsub -Gto the Slurm account - Added support for
fscgdeton both cgroup v1 and v2 statesvc: added job extra information forintelliparamsbjobs:- Switched to using the Fsched API with server-side filtering to load job information
- Added the
start timeandfinish timefields
bqueues:- Added the
-m cluster_nameoption - Added
loadSched/loadStopinformation to the-loutput - Added the
-allocoption - Added the
-u user,alloption - Added
JL/UandJL/Houtput
- Added the
- Fixed issues:
- Fixed an issue where
bsub -Idid not correctly forward command arguments, and fixed the permission issue withbsub -Ep bjobs: fixed-Aand-UF, fixed display byarray_job_idlist, fixed memory usage display, and added job descriptions, scheduling parameters, and resource requirement details to the-loutputbqueues:- Fixed scheduling parameter display in the
-loutput - Fixed multi-task jobs
- Fixed
-m all - Fixed error messages and error codes when a partition or host could not be found
- Fixed
Usersin the-loutput
- Fixed scheduling parameter display in the
- Fixed an issue where
- Added the
Fsched 10.77
Updated: 2025.03.14
- Added support for systems using cgroup v2
- Added
LoadStopandLoadSchedparameter settings based on CPU load - Added support for
bjobs -lto display task information submitted by other users - Fixed a number of known issues
Fsched 10.62
Updated: 2024.12.13
- Added support for configuring multiple partition administrators, authorizing them to cancel any job in the partition and control whether the partition accepts jobs (enable/stop)
- Added support for setting the maximum available CPU count at the partition level
- Added QoS policy support for setting the maximum resource minutes that all running jobs can use for each account or user; when a job exceeds the configured limit, it remains pending
- Allowed updating job memory, requiring the
select/cons_tres_explugin when the job is running or pending - Allowed updating job CPU allocation, only for single-node jobs and requiring the
select/cons_tres_explugin - Added support for querying job usage information, node load information, and user usage information for completed jobs
- Added parsing for parts of the
qsubandsqtatcommand parameters in the SGE wrapper - Fixed a number of known issues
Fsched 10.37
Updated: 2024.09.15
- Allowed users to increase the time limit of already submitted jobs
- Allowed configuring a partition-level option to kill jobs that exceed their requested memory
- Added
loadStopandloadSchedsettings based on CPU and memory utilization - Avoided potential job failures during scheduling when the authentication system, such as LDAP or NIS, becomes unavailable
- Added statistics such as resource usage for running jobs
- Improved response speed when canceling
srunjobs - Added node load and job load collection mechanisms, and used them to improve the output of the
lsloadandbjobscommands in the LSF wrapper - Improved the failover mechanism in HA scenarios to shorten switchover time
- Improved stability under high load