xid creation in dataset #9

aburns4 · 2021-02-22T20:50:31Z

Hi,

I was wondering if you could explain how you obtained the 'xid' values for the annotations in the dataset. Did you perform breadth or depth first search and number elements in the DOM according to the traversal? Were there any other specifications to count the elements, such as whether they were visible or not?

Thank you!

ppasupat · 2021-02-23T00:57:25Z

Hello. The xids were generated in the order where the open tag appears (which is equivalent to depth-first search). All tags, including invisible ones, get an xid.

The dataset, which was processed by beautifulsoup, uses the following

for x in soup.body(True):     # Select all nodes
    x['data-xid'] = i
    i += 1

The demo Chrome extension also does something similar in the injectXids function.

aburns4 · 2021-02-28T14:35:12Z

Hi, okay. What about text elements that do not have children? It doesn't seem they have xids in the data files.

Thank you again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xid creation in dataset #9

xid creation in dataset #9

aburns4 commented Feb 22, 2021

ppasupat commented Feb 23, 2021

aburns4 commented Feb 28, 2021

xid creation in dataset #9

xid creation in dataset #9

Comments

aburns4 commented Feb 22, 2021

ppasupat commented Feb 23, 2021

aburns4 commented Feb 28, 2021