Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RL Unplugged] - Trained policies for finger_turn_hard do not match the datasets #465

Open
JenAlchimowicz opened this issue Oct 9, 2023 · 0 comments

Comments

@JenAlchimowicz
Copy link

JenAlchimowicz commented Oct 9, 2023

Hi, thanks for the good work.

I am running custom offline evaluation on the tasks from DeepMind Control Suite Dataset. I am taking the data from Tensorflow Datasets here and downloading the maching policies from the GCP bucket here.

Most tasks work fine, but when I get to finger_turn_hard I get the following error while inferring the policies on the data:
image

It looks like the policy is expecting one of the keys to be "touch", however, this key is not in the observations. According to the documentation here, there is no dimension "touch" in this dataset.

For context, I am loading the policy using:

tf.saved_model.load(policy_path)

And inferring using:

if hasattr(policy, 'initial_state'):
    action = policy(observation, ((),))[0]
else:
    action = policy(observation)

Am I missing something here? How can I infer the finger_turn_hard policies on the provided datasets?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant