Skip to content

Commit

Permalink
feat: disable auto include table title
Browse files Browse the repository at this point in the history
  • Loading branch information
许瑞 authored and 许瑞 committed Mar 26, 2024
1 parent f0c463e commit cb1b02e
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion magic_pdf/pdf_parse_for_train.py
Original file line number Diff line number Diff line change
Expand Up @@ -220,7 +220,7 @@ def parse_pdf_for_train(
# 解析表格并对table_bboxes进行位置的微调,防止表格周围的文字被截断
table_bboxes = parse_tables(page_id, page, model_output_json)
table_bboxes = fix_tables(
page, table_bboxes, include_table_title=True, scan_line_num=2
page, table_bboxes, include_table_title=False, scan_line_num=2
) # 修正
table_bboxes = fix_table_text_block(
text_raw_blocks, table_bboxes
Expand Down

0 comments on commit cb1b02e

Please sign in to comment.