摘要

Causal discovery is a fundamental problem in scientific research. Although many researchers are committed to finding causal relationships from observational data, large-scale causal discovery remains a tremendous challenge. In this paper, a new approach for large-scale causal discovery is proposed, based on a split-and-merge strategy. The method first splits a given dataset into small subdatasets using a graph-partitioning method and then develops a effective algorithm to infer the causality of each subdataset. The entire causal structure with respect to the given dataset is achieved by combining all the causalities of each subdataset. The experimental results show that the proposed approach is effective and scalable for large-scale causal discovery problems.