We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hello!
at this point of the merging procedure
sdweb-merge-block-weighted-gui/scripts/mbw/merge_block_weighted.py
Lines 141 to 162 in 8a62a75
shouldn't be theta1 instead of theta0? For example,
theta1
theta0
for key in tqdm(theta_1.keys(), desc="Stage 2/2"): if "model" in key and key not in theta_1: if KEY_POSITION_IDS in key: if skip_position_ids == 1: print(f" modelB: skip 'position_ids' : {theta_1[KEY_POSITION_IDS].dtype}") dprint(f"{theta_1[KEY_POSITION_IDS]}", verbose) continue elif skip_position_ids == 2: theta_1[key] = torch.tensor([list(range(77))], dtype=torch.int64) print(f" modelB: reset 'position_ids': {theta_1[KEY_POSITION_IDS].dtype}") dprint(f"{theta_1[KEY_POSITION_IDS]}", verbose) continue else: print(f" modelB: 'position_ids' key found. do nothing : {skip_position_ids}")
The text was updated successfully, but these errors were encountered:
No branches or pull requests
hello!
at this point of the merging procedure
sdweb-merge-block-weighted-gui/scripts/mbw/merge_block_weighted.py
Lines 141 to 162 in 8a62a75
shouldn't be
theta1
instead oftheta0
? For example,The text was updated successfully, but these errors were encountered: