HAL Id: hal-01432986
https://hal.univ-lille.fr/hal-01432986
Submitted on 12 Jan 2017
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
TEL
Agnès Magron
To cite this version:
Agnès Magron. TEL : an open access corpus available for research. 19th International Symposium on Electronic Theses and Dissertations (ETD 2016): ”Data and Dissertations” , Jul 2016, Villeneuve d’Ascq, France. �hal-01432986�
Engineering sciences
Physics
Computer science Humanities and
social sciences Life sciences
Sciences of the universe
Mathematics
Chemical sciences
an open access corpus available for research
TEL
https://tel.archives-ouvertes.frETD 2016 Data and dissertations : 19th International Symposium on Electronic Theses and Dissertations 11-13th July 2016 Lille (France)
Auteur : Agnès Magron (CCSD)
Open repository for dissertations
Glossary
ABES : Agence Bibliographique de l’Enseignement Supérieur STAR : Signalement des Thèses électroniques et ARchivage
TEL (Theses-en-Ligne) is a french open repository created in 2001 and dedicated to theses self-archiving.
With the partnership with ABES through its STAR application, it became a national repository used by institutions for the diffusion of electronic theses.
+55 000
fulltext theses
Re-use of the data in the later publications ?
OAI-PMH and API
TEL can be harvested via OAI-PMH protocol
Repository Name HAL
Base URL https://api.archives-ouvertes.fr/oai/tel/
Protocol Version 2.0
Earliest Datestamp 23/09/2002 Deleted Record no
Granularity YYY-MM-DD
Admin Email contact@archives-ouvertes.fr
Metadata describing the theses (author, universities, jury members…) form a corpus that can be used for any study on doctoral research in France.
API allow to search through them with a high level of accuracy. They can be used for forming a corpus, producing statistics, cross-check data, and compare them to other corpus data.
https://api.archives-ouvertes.fr/docs
year nb theses
after 2010 31319
2000-2009 19025
1990-1999 2575
1980-1989 1181
1970-1979 722
1900-1970 350
before 1900 7
The oldest dates back to 1883
Languages French : 80,8 % English : 18,7 %
Others : portuguese, spanish, italian, ...
agnes.magron@ccsd.cnrs.fr
Since TEL is a part of the open repository HAL, the data can also be observed in the author's other publications, if they've been submitted to HAL
32 % of theses are self-archived.
Multidisciplinary