Spaces:

rt4u
/

marker

Sleeping

Vik Paruchuri commited on Feb 24

Commit

478b8f8

1 Parent(s): b06e51f

Bump version

Files changed (3) hide show

marker/processors/debug.py CHANGED Viewed

@@ -93,6 +93,9 @@ class DebugProcessor(BaseProcessor):
             line_bboxes = []
             line_text = []
             for child in page.children:
                 if child.block_type != BlockTypes.Line:
                     continue

             line_bboxes = []
             line_text = []
             for child in page.children:
+                if child.removed:
+                    continue
                 if child.block_type != BlockTypes.Line:
                     continue

marker/processors/llm/llm_image_description.py CHANGED Viewed

@@ -23,7 +23,7 @@ You will receive an image of a picture or figure.  Your job will be to create a
 **Instructions:**
 1. Carefully examine the provided image.
 2. Analyze any text that was extracted from within the image.
-3. Output a 3-4 sentence description of the image.  Make sure there is enough specific detail to accurately describe the image.  If there are numbers included, try to be specific.
 **Example:**
 Input:
 ```text

 **Instructions:**
 1. Carefully examine the provided image.
 2. Analyze any text that was extracted from within the image.
+3. Output a faithful description of the image.  Make sure there is enough specific detail to accurately reconstruct the image.  If the image is a figure or contains numeric data, include the numeric data in the output.
 **Example:**
 Input:
 ```text

pyproject.toml CHANGED Viewed

@@ -1,7 +1,7 @@
 [tool.poetry]
 name = "marker-pdf"
-version = "1.5.6"
-description = "Convert PDF to markdown with high speed and accuracy."
 authors = ["Vik Paruchuri <github@vikas.sh>"]
 readme = "README.md"
 license = "GPL-3.0-or-later"

 [tool.poetry]
 name = "marker-pdf"
+version = "1.6.0"
+description = "Convert documents to markdown with high speed and accuracy."
 authors = ["Vik Paruchuri <github@vikas.sh>"]
 readme = "README.md"
 license = "GPL-3.0-or-later"