• Aucun résultat trouvé

Astronomy and Huge Data

N/A
N/A
Protected

Academic year: 2021

Partager "Astronomy and Huge Data"

Copied!
2
0
0

Texte intégral

(1)

Publisher’s version / Version de l'éditeur:

Skygazing: Astronomy through the seasons, 2018-10-02

READ THESE TERMS AND CONDITIONS CAREFULLY BEFORE USING THIS WEBSITE. https://nrc-publications.canada.ca/eng/copyright

Vous avez des questions? Nous pouvons vous aider. Pour communiquer directement avec un auteur, consultez la

première page de la revue dans laquelle son article a été publié afin de trouver ses coordonnées. Si vous n’arrivez pas à les repérer, communiquez avec nous à PublicationsArchive-ArchivesPublications@nrc-cnrc.gc.ca.

Questions? Contact the NRC Publications Archive team at

PublicationsArchive-ArchivesPublications@nrc-cnrc.gc.ca. If you wish to email the authors directly, please see the first page of the publication for their contact information.

NRC Publications Archive

Archives des publications du CNRC

This publication could be one of several versions: author’s original, accepted manuscript or the publisher’s version. / La version de cette publication peut être l’une des suivantes : la version prépublication de l’auteur, la version acceptée du manuscrit ou la version de l’éditeur.

For the publisher’s version, please access the DOI link below./ Pour consulter la version de l’éditeur, utilisez le lien DOI ci-dessous.

https://doi.org/10.4224/23004522

Access and use of this website and the material on it are subject to the Terms and Conditions set forth at

Astronomy and Huge Data

Tapping, Ken

https://publications-cnrc.canada.ca/fra/droits

L’accès à ce site Web et l’utilisation de son contenu sont assujettis aux conditions présentées dans le site LISEZ CES CONDITIONS ATTENTIVEMENT AVANT D’UTILISER CE SITE WEB.

NRC Publications Record / Notice d'Archives des publications de CNRC:

https://nrc-publications.canada.ca/eng/view/object/?id=99472788-f83f-4c67-aefe-d7ed2ad0f53e https://publications-cnrc.canada.ca/fra/voir/objet/?id=99472788-f83f-4c67-aefe-d7ed2ad0f53e

(2)

ASTRONOMY AND HUGE DATA

Ken Tapping, 2ndOctober, 2018

We now live in an age of Big Data. Once we developed the technologies for handling and storing huge amounts of information, we went on to collect more and more of it. In the same way, astronomy is now in the age of “Huge Data”. Not very long ago making astronomical

observations consisted of setting up the telescope and instruments attached to it, pointing it at the object of interest and then manually recording the data. When computers first moved into astronomy, they were used to automate the operation of the telescope and to record data. We took the results away and used computers to analyze it. Then, as computers got smaller, faster and cheaper the game changed. With computer help our telescopes could record more data about more things, faster. We can now carry out and process large-scale surveys of the sky, and keep an eye open for transient events. We can make networks of many radio telescopes distributed over thousands of kilometres, processing their outputs digitally to emulate one huge radio telescope. Multitudes of small, high-speed computers now form parts of our instruments, no longer just controlling them. The result is a tsunami of data we have to store, make accessible, and somehow to analyze.

One other issue we needed to address is the enormous amount of astronomical information that has accumulated from past observations. Some came from large-scale surveys made at some observatories, and stored there. In addition, sitting in astronomers’ offices around the world was data from observations they had made in the past. This led to two serious problems. Firstly, astronomers would propose new observations not knowing that someone else had already made those

observations. Secondly, with the rapid evolution of data storage technology, stored data might have become unreadable because nobody has the devices to read it. For example, who these days has the means to read a floppy disc? The solution is to put all the data in special-purpose data

centres, where it is archived, backed up and provided in a form that astronomers and other researchers can access as and when they need it. Our national system is called the Canadian

Astronomy Data Centre – the CADC.

We have all heard of something out there in our digital world called “the cloud”. This rather mystical name refers to a number of huge “server farms”: data storage places that hold, archive and generally look after your data and software, and provide additional tools you might need for accessing and working with it. The CADC and other astronomical data centres form a “cloud” for the scientific community. The huge amounts of data coming out of the latest astronomical instruments and our desire to make that data as broadly accessible as possible forces us in that direction. However, having all this data available poses another serious problem. How can we search an enormous number of files and databases for the information we need?

We’ve all used “search engines” to find information on the Internet. These devices use forms of

artificial intelligence: computer programs that emulate certain aspects of the way we search for and assimilate information. In a similar way, we use software assistance to search out what we need from our rapidly growing pile of data we are accumulating about the universe we live in. However, it will be a while before we completely eliminate the need to dig around in the data ourselves, because it is very difficult to program in all the questions we might possibly ask, and research essentially involves asking questions that have never been asked before.

Mars is still conspicuous in the southern sky. Saturn lies low in the south and Jupiter very low in the southwest after sunset. The Moon will reach Last Quarter on the 2ndand be New on the 8th.

Ken Tapping is an astronomer with the National Research Council's Dominion Radio Astrophysical Observatory, Penticton, BC, V2A 6J9.

Tel (250) 497-2300, Fax (250) 497-2355 E-mail: ken.tapping@nrc-cnrc.gc.ca

Références

Documents relatifs

5.2 Simulated sampling design based CWGQL (DBCWGQL) estimates for the diagnostic and/ or prognost ic covariates in MAR based incomplete longitudinal models for count data

A lot of work has been done in the concerned direction by different authors, and the following are the contribu- tions of this work: (1) An exhaustive study of the existing systems

Примерами такой проверки по вносимым данным могут быть: автоматизированная проверка на то, что такой результат не был внесён в

The Hadoop framework has also been shown to be inefficient within the domain of large- scale graph processing and streaming data [1].. It confirms a similar situation in data

In the case of population-based surveys, further standardization of methodologies, indicators and definitions would enable countries to collect more and comparable data sets on

Hierarchical classifiers considered in the paper combine process mining, extraction of attributes on the basis of temporal patterns and construct- ing classifiers based on

To preserve data and query confidentiality from possible leakage, our solution exploits optimal encoding for genomic variants and combines it with homomorphic encryption and

To assure quality of, and access to, data, many funding agencies, such as the NSF and the DFG (Deutsche Forschungsgemeinschaft, Germany), require proposals to have data management