Skip to content

Commit 79cee9d

Browse files
manav-afacebook-github-bot
authored andcommitted
Temporary Commit at 8/23/2024, 4:23:21 PM
Differential Revision: D61746435
1 parent bfce4bd commit 79cee9d

File tree

1 file changed

+1
-8
lines changed

1 file changed

+1
-8
lines changed

torchx/specs/api.py

Lines changed: 1 addition & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -237,17 +237,12 @@ class RetryPolicy(str, Enum):
237237
application to deal with failed replica departures and
238238
replacement replica admittance.
239239
2. APPLICATION: Restarts the entire application.
240-
3. HOT_SPARE: Restarts the replicas for a role as long as quorum (min_replicas)
241-
is not violated using extra hosts as spares. It does not really support
242-
elasticity and just uses the delta between num_replicas and min_replicas
243-
as spares (EXPERIMENTAL).
244-
4. ROLE: Restarts the role when any error occurs in that role. This does not
240+
3. ROLE: Restarts the role when any error occurs in that role. This does not
245241
restart the whole job.
246242
"""
247243

248244
REPLICA = "REPLICA"
249245
APPLICATION = "APPLICATION"
250-
HOT_SPARE = "HOT_SPARE"
251246
ROLE = "ROLE"
252247

253248

@@ -347,8 +342,6 @@ class Role:
347342
and num_replicas depending on the cluster resources and
348343
policies. If the scheduler doesn't support auto scaling this
349344
field is ignored and the job size will be num_replicas.
350-
EXPERIMENTAL: For HOT_SPARE restart policy this field is used to
351-
indicate the quorum required for the job to run.
352345
max_retries: max number of retries before giving up
353346
retry_policy: retry behavior upon replica failures
354347
resource: Resource requirement for the role. The role should be scheduled

0 commit comments

Comments
 (0)