Skip to content

Cloudml_train and job_collect #210

@philipus

Description

@philipus

i have a problem by applying mnist_mlp.R (https://github.com/rstudio/keras/blob/master/vignettes/examples/mnist_mlp.R) using cloudml_train on google cloud platform.

Even the job on google ai platforms run properly the job does not finish automatically. Also or because of that the job_collect functionality does not copy any files into local directory (runs)... when I cancel the job manually on google ai platform I see the the new job folder of the corresponding job.

So... why the hack the job runs for ever on google ai platform?!

I think the download functionality does not work properly. I also do not have a local runs directory created as it does in the mnist_mlp.R script. I think job_collect is the problem

cloudml::job_collect('Project Name', destination = '../runs', view = 'save')

does not copy anything in the destination folder

Any Idea what we can do?

R commands:

library(cloudml)
cloudml_train("mnist_mlp.R", config = "config.yml")

config.yml:

trainingInput:
scaleTier: BASIC
runtimeVersion: "2.1"
pythonVersion: "3.7"

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions