You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
But I found that the benchmark prediction time is about twice the actual model training time.
I think the problem lies in:
According to the function, def extract_averages(file_path,args):, compute_cache calculates the time of the entire attention or mlp process. However in the generated workload, an attention or mlp is decomposed into two parts, such as:
The entire attention time is assigned to MegatronColumnLinear, resulting in this issue:
def Comp_with_aiob(workload, compute_cache):
for item in workload.workload:
if item.comm_type == CommType.computation:
for key in compute_cache:
key_temp = key.split("_")[0]
if key_temp in item.stage:
item._elapsed_time = compute_cache[key]
break
return workload
The text was updated successfully, but these errors were encountered:
During the process of running AICB with customized parameters, my command is as follows:
But I found that the benchmark prediction time is about twice the actual model training time.
I think the problem lies in:
According to the function,
def extract_averages(file_path,args):
,compute_cache
calculates the time of the entire attention or mlp process. However in the generated workload, an attention or mlp is decomposed into two parts, such as:The entire attention time is assigned to MegatronColumnLinear, resulting in this issue:
The text was updated successfully, but these errors were encountered: