You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Something I have learned is you can append -index.html to the end of a CIK which will give you a far more parse-able HTML file than the SGML you get as a reference from one of the indices (also I trust that SGML about as far as I can throw it, it contains PDFs and other blobs in it... so it's not far). That said, parsing the SGML would give you the contents of the index with a single download.
@itsclaireh is it possible to get added to this repo and maybe make assignment stuff for this? I would like to explore exposing this data
@michael-watson you should be able to fork the repo, and once this gets off the ground and has some organizational stuff setup, then you could file a pull request!
EDGAR maintains indices of all SEC filings, which you can find documentation for here:
https://www.sec.gov/edgar/searchedgar/accessing-edgar-data.htm
Something I have learned is you can append
-index.html
to the end of a CIK which will give you a far more parse-able HTML file than the SGML you get as a reference from one of the indices (also I trust that SGML about as far as I can throw it, it contains PDFs and other blobs in it... so it's not far). That said, parsing the SGML would give you the contents of the index with a single download.example line from an index:
URL-hacked index:
https://www.sec.gov/Archives/edgar/data/738076/0001005477-00-005922-index.html
disclaimer, I have no idea what this company is, I just grabbed a random line with a 10-K filing
The text was updated successfully, but these errors were encountered: