The example of how to get retrieval metrics along with answer inference based on the context. "ctx" refers to 'context' "ans" refers to 'answer' "gt" refers to 'ground truth answer' "ctx_ans_inference ...
We aim to evaluate Large Language Models (LLMs) for embodied decision-making. While many works leverage LLMs for decision-making in embodied environments, a systematic understanding of their ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results