2007

Improving Transparency: Extracting, Visualizing, and Analyzing Corporate Relationships from SEC 10-K Documents

G. Lucas, M. Gebbie, K. Norlen and J. Chuang, "Improving Transparency: Extracting, Visualizing, and Analyzing Corporate Relationships from SEC 10-K Documents". International Journal on Technology, Policy and Management, Vol. 7, No. 1, pp. 15-31, 2007.

Abstract

We present a system to extract, visualise and analyse inter-corporation relationships disclosed by public companies in their annual reports to the US Securities and Exchange Commission (SEC). In improving the transparency of these disclosures, we allow policymakers, analysts, investors and the general public to analyse these relationships at both the firm level and the industry level. Using probabilistic information retrieval and extraction techniques, we automatically extract a dataset of 45,000 relationships between 26,000 companies from over 15 GB of SEC 10-K documents. These relationships range from ownerships, agreements and personal connections to competition and legal disagreements. Information visualisation and social network analytic techniques can then be applied to explore and analyse the dataset.

Author(s)

Last updated: September 20, 2016