You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
EvaluationTracker.save() will fail at dataset = Dataset.from_list([asdict(detail) for detail in task_details]) with
Exception has occurred: ArrowInvalid (note: full exception trace is shown but execution is paused at: _run_module_as_main)
cannot mix list and non-list, non-null values
if the task launched uses metrics requiring both generative and logprobs metrics. (because they don't save lists of the same lengths for ex)