S. Abiteboul, O. Benjelloun, and T. Milo, the Israeli Centers of Research Excellence (I- CORE) program (Center No The Active XML project: an overview, Council grant Webdam on Foundations of Web Data Management, p.226513, 2008.

S. Abiteboul, P. Bourhis, A. Galland, and B. Marinoiu, The AXML Artifact Model, 2009 16th International Symposium on Temporal Representation and Reasoning, 2009.
DOI : 10.1109/TIME.2009.9

URL : https://hal.archives-ouvertes.fr/inria-00447694

S. Abiteboul, T. H. Chan, E. Kharlamov, W. Nutt, and P. Senellart, Aggregate queries for discrete and continuous probabilistic XML, Proceedings of the 13th International Conference on Database Theory, ICDT '10, 2010.
DOI : 10.1145/1804669.1804679

URL : https://hal.archives-ouvertes.fr/inria-00537632

S. Abiteboul, B. Kimelfeld, Y. Sagiv, and P. Senellart, On the expressiveness of probabilistic XML models, The VLDB Journal, vol.31, issue.4, 2009.
DOI : 10.1007/s00778-009-0146-1

URL : https://hal.archives-ouvertes.fr/inria-00429498

T. Antonopoulos, F. Geerts, W. Martens, and F. Neven, Generating, sampling and counting subclasses of regular tree languages, ICDT, 2011.

D. Barbosa, A. O. Mendelzon, J. Keenleyside, and K. A. Lyons, ToXgene, Proceedings of the 2002 ACM SIGMOD international conference on Management of data , SIGMOD '02, 2002.
DOI : 10.1145/564691.564769

G. J. Bex, W. Gelade, F. Neven, and S. Vansummeren, Learning deterministic regular expressions for the inference of schemas from XML data, WWW, 2008.

G. J. Bex, F. Neven, T. Schwentick, and K. Tuyls, Inference of concise DTDs from XML data, VLDB, 2006.

G. J. Bex, F. Neven, and S. Vansummeren, Inferring XML schema definitions from XML data, VLDB, 2007.

C. M. Bishop, Pattern Recognition and Machine Learning, 2006.

Z. Chi and S. Geman, Estimation of probabilistic context-free grammars, Comput. Linguist, vol.24, issue.2, 1998.

S. Cohen, Generating XML structure using examples and constraints, Proceedings of the VLDB Endowment, vol.1, issue.1, 2008.
DOI : 10.14778/1453856.1453910

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

S. Cohen, B. Kimelfeld, and Y. Sagiv, Incorporating constraints in probabilistic XML, PODS, 2008.

C. David, L. Libkin, and T. Tan, Efficient reasoning about data trees via integer linear programming, ICDT, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00835833

K. Etessami and M. Yannakakis, Recursive Markov chains, stochastic grammars, and monotone systems of nonlinear equations, JACM, vol.56, issue.1, 2009.

W. Fan and L. Libkin, On XML integrity constraints in the presence of DTDs, JACM, vol.49, issue.3, 2002.

D. Freedman, Markov Chains, 1983.
DOI : 10.1007/978-1-4612-5500-0

M. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim, XTRACT: a system for extracting document type descriptors from XML documents, SIGMOD, 2000.

W. Gelade, T. Idziaszek, W. Martens, and F. Neven, Simplifying XML schema, Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems of data, PODS '10, 2010.
DOI : 10.1145/1807085.1807118

G. Grahne and J. Zhu, Discovering approximate keys in XML data, Proceedings of the eleventh international conference on Information and knowledge management , CIKM '02, 2002.
DOI : 10.1145/584792.584867

R. Kosala, H. Blockeel, M. Bruynooghe, and J. Van-den-bussche, Information extraction from structured documents using k-testable tree automaton inference, Data & Knowledge Engineering, vol.58, issue.2, 2006.
DOI : 10.1016/j.datak.2005.05.002

K. Lary and S. J. Young, The estimation of stochastic context-free grammars using the inside-outside algrithm. Computer Speech and Language, 25] W. Martens, F. Neven, and T. Schwentick. Simple off the shelf abstractions for xml schema. SIGMOD Record, pp.15-22, 1990.

W. Martens, F. Neven, T. Schwentick, and G. J. Bex, Expressiveness and complexity of XML Schema, ACM Transactions on Database Systems, vol.31, issue.3, p.31, 2006.
DOI : 10.1145/1166074.1166076

W. Martens, F. Neven, T. Schwentick, and G. J. Bex, Expressiveness and complexity of XML Schema, ACM Transactions on Database Systems, vol.31, issue.3, pp.770-813, 2006.
DOI : 10.1145/1166074.1166076

W. Martens and J. Niehren, On the minimization of XML Schemas and tree automata for unranked trees, Journal of Computer and System Sciences, vol.73, issue.4, 2007.
DOI : 10.1016/j.jcss.2006.10.021

URL : https://hal.archives-ouvertes.fr/inria-00088406

T. Milo and D. Suciu, Type inference for queries on semistructured data, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '99, 1999.
DOI : 10.1145/303976.303998

M. Murata, D. Lee, M. Mani, and K. Kawaguchi, Taxonomy of XML schema languages using formal language theory, ACM Transactions on Internet Technology, vol.5, issue.4, 2005.
DOI : 10.1145/1111627.1111631

S. Nestorov, S. Abiteboul, and R. Motwani, Extracting schema from semistructured data, SIGMOD, 1998.