Skip to content

Instances with requested resources not detected #79

@olgabot

Description

@olgabot

Hello!
I'm running a batch job which requires a lot of memory so I added x1e.2xlarge (244 GiB memory, 8 CPUs) to my config.yml:

screen shot 2018-10-01 at 10 28 40 am

 Mon  1 Oct - 10:22  ~/code/tick-genome/pre_assembly_qc/full_workflow   origin ☊ master ✔ 2☀ 
  grep x1e.2xlarge ~/.reflow/config.yaml
  - x1e.2xlarge

But when I run the batch job, it claims that there aren't any available instances of that type! And there's no "allowable instances exceeded" error, either, so I don't think it's because someone else in the org is using x1e.2xlarge.

 Mon  1 Oct - 10:22  ~/code/tick-genome/pre_assembly_qc/full_workflow   origin ☊ master ✔ 2☀ 
  reflow runbatch -retry     
reflow: batch program ../../reflow/pre-assembly.rf runsfile samples.csv
retrying run Undetermined_S0
retrying run tick_1_S1
retrying run tick_2_S2
reflow: run tick_2_S2: error: resources exhausted: requested resources {mem:240.0GiB cpu:8 disk:1.0GiB} not satisfiable by any available instance type
reflow: run Undetermined_S0: error: resources exhausted: requested resources {mem:240.0GiB cpu:8 disk:1.0GiB} not satisfiable by any available instance type
reflow: run tick_1_S1: error: resources exhausted: requested resources {mem:240.0GiB cpu:8 disk:1.0GiB} not satisfiable by any available instance type

EDIT: If I add x1e.4xlarge (488 GiB ram, 16 CPUs) to my list of instances, the job goes through, but I'm wondering why does it need more resources when the x1e.2xlarge instance should be sufficient?

Do you know what may be happening?
Warmest,
Olga

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions