Publisher’s version / Version de l'éditeur:
Skygazing: Astronomy through the seasons, 2018-10-02
READ THESE TERMS AND CONDITIONS CAREFULLY BEFORE USING THIS WEBSITE. https://nrc-publications.canada.ca/eng/copyright
Vous avez des questions? Nous pouvons vous aider. Pour communiquer directement avec un auteur, consultez la
première page de la revue dans laquelle son article a été publié afin de trouver ses coordonnées. Si vous n’arrivez pas à les repérer, communiquez avec nous à PublicationsArchive-ArchivesPublications@nrc-cnrc.gc.ca.
Questions? Contact the NRC Publications Archive team at
PublicationsArchive-ArchivesPublications@nrc-cnrc.gc.ca. If you wish to email the authors directly, please see the first page of the publication for their contact information.
NRC Publications Archive
Archives des publications du CNRC
This publication could be one of several versions: author’s original, accepted manuscript or the publisher’s version. / La version de cette publication peut être l’une des suivantes : la version prépublication de l’auteur, la version acceptée du manuscrit ou la version de l’éditeur.
For the publisher’s version, please access the DOI link below./ Pour consulter la version de l’éditeur, utilisez le lien DOI ci-dessous.
https://doi.org/10.4224/23004522
Access and use of this website and the material on it are subject to the Terms and Conditions set forth at
Astronomy and Huge Data
Tapping, Ken
https://publications-cnrc.canada.ca/fra/droits
L’accès à ce site Web et l’utilisation de son contenu sont assujettis aux conditions présentées dans le site LISEZ CES CONDITIONS ATTENTIVEMENT AVANT D’UTILISER CE SITE WEB.
NRC Publications Record / Notice d'Archives des publications de CNRC:
https://nrc-publications.canada.ca/eng/view/object/?id=99472788-f83f-4c67-aefe-d7ed2ad0f53e https://publications-cnrc.canada.ca/fra/voir/objet/?id=99472788-f83f-4c67-aefe-d7ed2ad0f53e
ASTRONOMY AND HUGE DATA
Ken Tapping, 2ndOctober, 2018We now live in an age of Big Data. Once we developed the technologies for handling and storing huge amounts of information, we went on to collect more and more of it. In the same way, astronomy is now in the age of “Huge Data”. Not very long ago making astronomical
observations consisted of setting up the telescope and instruments attached to it, pointing it at the object of interest and then manually recording the data. When computers first moved into astronomy, they were used to automate the operation of the telescope and to record data. We took the results away and used computers to analyze it. Then, as computers got smaller, faster and cheaper the game changed. With computer help our telescopes could record more data about more things, faster. We can now carry out and process large-scale surveys of the sky, and keep an eye open for transient events. We can make networks of many radio telescopes distributed over thousands of kilometres, processing their outputs digitally to emulate one huge radio telescope. Multitudes of small, high-speed computers now form parts of our instruments, no longer just controlling them. The result is a tsunami of data we have to store, make accessible, and somehow to analyze.
One other issue we needed to address is the enormous amount of astronomical information that has accumulated from past observations. Some came from large-scale surveys made at some observatories, and stored there. In addition, sitting in astronomers’ offices around the world was data from observations they had made in the past. This led to two serious problems. Firstly, astronomers would propose new observations not knowing that someone else had already made those
observations. Secondly, with the rapid evolution of data storage technology, stored data might have become unreadable because nobody has the devices to read it. For example, who these days has the means to read a floppy disc? The solution is to put all the data in special-purpose data
centres, where it is archived, backed up and provided in a form that astronomers and other researchers can access as and when they need it. Our national system is called the Canadian
Astronomy Data Centre – the CADC.
We have all heard of something out there in our digital world called “the cloud”. This rather mystical name refers to a number of huge “server farms”: data storage places that hold, archive and generally look after your data and software, and provide additional tools you might need for accessing and working with it. The CADC and other astronomical data centres form a “cloud” for the scientific community. The huge amounts of data coming out of the latest astronomical instruments and our desire to make that data as broadly accessible as possible forces us in that direction. However, having all this data available poses another serious problem. How can we search an enormous number of files and databases for the information we need?
We’ve all used “search engines” to find information on the Internet. These devices use forms of
artificial intelligence: computer programs that emulate certain aspects of the way we search for and assimilate information. In a similar way, we use software assistance to search out what we need from our rapidly growing pile of data we are accumulating about the universe we live in. However, it will be a while before we completely eliminate the need to dig around in the data ourselves, because it is very difficult to program in all the questions we might possibly ask, and research essentially involves asking questions that have never been asked before.
Mars is still conspicuous in the southern sky. Saturn lies low in the south and Jupiter very low in the southwest after sunset. The Moon will reach Last Quarter on the 2ndand be New on the 8th.
Ken Tapping is an astronomer with the National Research Council's Dominion Radio Astrophysical Observatory, Penticton, BC, V2A 6J9.
Tel (250) 497-2300, Fax (250) 497-2355 E-mail: ken.tapping@nrc-cnrc.gc.ca