Brought to you by:
Paper

Vialactea Visual Analytics Tool for Star Formation Studies of the Galactic Plane

, , , , , , , , , , , , , , , , and

Published 2018 June 28 © 2018. The Astronomical Society of the Pacific. All rights reserved.
, , Citation F. Vitello et al 2018 PASP 130 084503 DOI 10.1088/1538-3873/aac5d2

1538-3873/130/990/084503

Abstract

We present a visual analytics tool, based on the VisIVO suite, to exploit a combination of all new-generation surveys of the Galactic Plane to study the star formation process of the Milky Way. The tool has been developed within the VIALACTEA project, founded by the 7th Framework Programme of the European Union, that creates a common forum for the major new-generation surveys of the Milky Way Galactic Plane from the near-infrared to the radio, both in thermal continuum and molecular lines. Massive volumes of data are produced by space missions and ground-based facilities and the ability to collect and store them is increasing at a higher pace than the ability to analyze them. This gap leads to new challenges in the analysis pipeline to discover information contained in the data. Visual analytics focuses on handling these massive, heterogeneous, and dynamic volumes of information accessing the data previously processed by data mining algorithms and advanced analysis techniques with highly interactive visual interfaces offering scientists the opportunity for in-depth understanding of massive, noisy, and high-dimensional data.

Export citation and abstract BibTeX RIS

1. Introduction

Currently, Astrophysics data are produced at a very high rate, and the quantity of data collected and stored is increasing at a faster rate than the ability to analyze them. Massive volumes of unstructured data, coming from space missions and ground-based facilities, covering the whole frequency spectrum, form continuous streams of terabytes of data that have to be recorded and analyzed. Future large infrastructures such as the Square Kilometre Array (SKA)7 are estimated to generate an exabyte a day of raw data, which could be compressed to around 10 petabytes.

Modern astronomical surveys provide not only image data but also catalogues of millions of objects (for example stars and galaxies), each of them with tens of associated parameters. The amount and complexity of this data is so high that it far exceeds the ability of humans to evaluate it all. By common data analysis techniques like knowledge discovery, astronomers can find new phenomena, relationships and useful knowledge about the universe. Given that data are affected by noise, a visual analytics approach can help separating relevant data from noise and help identifying unexpected phenomena inside the massive and dynamic data streams.

Astronomy visualization (Hassan & Fluke 2011) encompasses topics such as optical and radio imaging, presentation of simulation results, multi-dimensional exploration of catalogs, and public outreach visuals. It is of fundamental importance in various stages of astronomical research: from the planning phase, through the observing processes or simulation runs, to quality control, qualitative knowledge discovery and quantitative analysis.

Many tools are available to astrophysicists for visualization and data access. For example, TOPCAT8 is an interactive graphical viewer and editor for astronomical tables such as object catalogs, designed to cope well with large tables. It offers a variety of ways to visualize and analyze the data, and extensive 2D and 3D visualization facilities. A number of options are provided for loading data from external sources, including Virtual Observatory (VO) services, thus providing a gateway to many remote archives of astronomical data. It can also interoperate with other desktop tools using VO Simple Application Messaging Protocol (SAMP)9 such as Aladin10 or SAOImage DS9.11 Aladin allows to visualize and manipulate digitized astronomical images or full surveys, superimpose entries from astronomical catalogs or databases, and interactively access related data and information from the Simbad12 database, the VizieR13 service and other archives for all known astronomical objects in the field. SAOImage DS9 is an astronomical imaging and data visualization application. It supports FITS images and binary tables, multiple frame buffers, region manipulation, and advanced features such as 2D, 3D and RGB frame buffers, mosaic images, tiling, blinking, geometric markers, colormap manipulation, scaling, arbitrary zoom, cropping, rotation, pan, and a variety of coordinate systems. Visualization Interface for the Virtual Observatory (VisIVO) (Sciacca et al. 2015), which is the core on which the presented visual analytics environment is based, offers a unique integrated ecosystem for visualization, including services for collaborative portals, mobile applications for visualization and data exploration and a number of key components such as workflow applications, analysis and data mining functionalities.

In recent years, the visual analytics is an emerging area of scientific analysis. It offers an integral approach combining visualization, human factors and data analysis (Keim et al. 2008). It integrates methodology from information analytics, geospatial analytics, and scientific analytics. Especially human factors (e.g., interaction, cognition, perception, collaboration, presentation, and dissemination) play a key role in the communication between human and computer, as well as in the decision-making process (Hilda et al. 2016). Visual analytics furthermore profits from methodologies developed in the fields of data management and knowledge representation, knowledge discovery and statistical analytics.

As an example, encube (Vohl et al. 2017) provides a large scale comparative visual analytics framework tailored for use with large tiled-displays and advanced immersive environments like the CAVE2 (a modern hybrid 2D and 3D virtual reality environment). The framework is designed to harness the power of high-end visualization environments for collaborative analysis of large subsets of data from radio surveys.

This paper presents the ViaLactea Visual Analytic tool (VLVA), an innovative environment for the study of star-forming regions on our Galaxy. It allows an integrated analysis and exploitation of the combination of all new-generation surveys (from infrared to radio) of the Galactic Plane from space missions and ground-based facilities, using a novel data and science analysis paradigm based on 3D visual analytics and data mining frameworks. We will show the main features of VLVA via three fundamental use cases related to i) the analysis of compact sources clumps and Spectral Energy Distributions (SEDs); ii) studies on diffuse bubble-like structures; and iii) filamentary structure analysis.

2. Background

The formation of stars and star clusters is by far the most important event that shapes the evolution and fate of galaxies. Let us take for example massive stars, which are responsible for the global ionization of the Interstellar Medium (ISM) (see e.g., Churchwell 2002 and references therein). Their energetic stellar winds and supernova blast waves direct the dynamical evolution of the ISM, shaping its morphology, energetics and chemistry, and influencing the formation of subsequent generations of stars and planetary systems.

Stars form in Molecular Clouds, where about half of the mass in the ISM is stored. Due to the strong extinction of optical light by dust grains mixed with the gas, the birth of stars must be studied at infrared (IR) and longer wavelengths that can penetrate the clouds (Rieke & Lebofsky 1985; Martin & Whittet 1990; Fitzpatrick & Massa 2009). The paradigm for the formation of solar-type stars via accretion from an envelope through a circumstellar disk predicts an evolution from cores to protostars, and finally pre-main sequence stars that is well matched with distinctive characteristics of their Spectral Energy Distribution (SED).

The empirical classification of the SED of low-mass Young Stellar Object (YSOs) either based on detailed modeling (Robitaille et al. 2006) or on simple color analysis (e.g., Gutermuth et al. 2009), as well as the dust envelope temperature and the evolution of integrated parameters like the bolometric luminosity and the mass of the circumstellar envelope (Molinari et al. 2008) have been used as powerful tools to constrain the evolutionary stage of YSOs. The formation of low-mass stars is however complicated by the fact that they are born for the most part in clusters together with more massive YSO. Higher-mass YSOs reach the conditions for H-burning faster than the time required to assemble them, so that winds and radiative acceleration will strongly influence their late accretion phases (Zinnecker & Yorke 2007) and that of the other cluster members.

In a traditional "slow formation" picture, diffuse ISM is accumulated by large-scale perturbations such as the passage of a spiral arm. Shielding by dust and surface reactions on grains promotes the transition from H i to H ii, which in turn allows the formation of other molecules that cool the cloud. Gravity, mediated by the magnetic fields leads to unstable dense clumps with typical sizes between 0.1 and 1 pc, that can further differentiate into a multiplicity of cores that are typically dubbed protoclusters. Typically each core harbors a forming YSOs eventually leading to a single star or a close binary. Alternative more dynamical scenarios posit that Molecular Clouds are transient, short-lived structures created in the post-shock regions of converging large-scale flows (Hartmann et al. 2001) or in turbulence driven compression (Padoan & Nordlund 2002) that produce layers and filaments where column density builds up shielding material from the Interstellar Radiation field. Cooling and gravitational instabilities will naturally arise (Heitsch et al. 2008), fragmenting the filaments under the effect of self gravity into chains of dense supercritical turbulent clumps that harbour massive protostars and protoclusters. Indeed, one of the most important and unexpected discoveries of the Herschel Galactic photometric surveys (Molinari et al. 2010a) is the Galaxy-wide ubiquity of the filamentary nature of the cold, dense and potentially star-forming phase of the ISM. Star formation could also be triggered in clumps located at the interfaces between dense clouds and expanding H ii regions or supernova remnants (Deharveng & Zavagno 2008).

The fact that the mass function in protoclusters has a similar shape to the Initial Mass Function in field stars suggests that the end-products of star formation are entirely defined as early as the clump formation stage. Therefore the detailed morphology of dense star-forming structures, their mass and temperature distribution, and the timescales for their formation and for star formation inside them, hold the key to identify the dominant mechanism responsible for their formation.

The EU FP7 VIALACTEA project14 aims to exploit the combination of all new-generation surveys of the Galactic Plane to build and deliver a galaxy scale predictive model for star formation of the Milky Way. Usually the essential steps necessary to unveil the inner workings of the galaxy as a star formation engine are often carried out manually, and necessarily over a limited number of galactic sources or very restricted regions. Therefore scientists required new technological solutions able to deal with the growing data size and quantity coming from the available surveys to extract the meaningful informations contained in the data. This resulted in an innovative framework (Becciani et al. 2015) based on advanced visual analytics techniques (presented in this paper) accessing a knowledge base based on Virtual Observatory (VO) data representation and retrieval standards that is enriched thanks to data mining and machine learning methodologies.

A new set of complete and high spatial resolution Galactic Plane Surveys has been assembled and stored into the ViaLactea Knowledge Base (VLKB) (Molinaro et al. 2016). The VLKB storage contains files in FITS format, from 2D images in the radio continuum to 3D FITS cubes containing radio velocity spectra at specific molecular lines and, also, a collection of 3D extinction maps. Apart from the Herschel infrared Galactic Plane Survey (Molinari et al. 2010b), the core resource from which most of the primary products are derived, all of the other resources were retrieved from already public repositories or released to the community retaining their private data policy. Alongside the data collections, a relational database completes the VLKB resource content in terms of knowledge derived from the data and contains information related to 2D maps; filaments and bubbles; compact sources; and radio cubes. The available surveys can be classified into

  • 1.  
    Near-IR: UKIDSS15 Galactic Plane Survey and VISTA VVV16 Survey;
  • 2.  
    Mid-IR: The GLIMPSE17 WISE18 and MIPSGAL19 Surveys;
  • 3.  
    Far-IR: Herschel HI-GAL20 Survey;
  • 4.  
    Sub-millimeter continuum: ATLASGALCsengeri et al. (2016) and JCMT21 surveys;
  • 5.  
    Molecular and Atomic Line Surveys: the Galactic Ring Survey (GRS)22 and the International Galactic Plane Survey (IGPS)23 ;
  • 6.  
    Radio continuum: the CORNISH24 and MAGPISHelfand et al. (2006) Surveys; and
  • 7.  
    Molecular Masers: the Methanol Multi-Beam survey MMB25 .

3. Use CASES

In this section we show three exemplifying use cases where VLVA provides a key tool in all phases of scientific analysis thanks to its tight integration with the VLKB. In particular we will focus on the investigation of stars cluster formation and on the study of extended sources and filamentary structures.

VLVA combines different types of visualization to perform the analysis exploring the correlation between different data, for example 2D intensity images with 3D molecular spectral cubes. The scientist is enabled to discover the link between different physical structures, from the extended filamentary-shaped structures to the most compact, dense sources precursors of new stars.

The implementation philosophy behind VLVA is to make transparent to the scientist the access to all informations without requiring technical skills to query the VLKB. This easy accessibility to the huge amount of data is offered since the first approach with the tool to identify the candidate star formation site happens navigating a panoramic view of the Milky way. Therefore, the scientist is enabled to visually select the region of interest (with rectangular or circular selection as shown in Figure 1), retrieving it automatically (no additional technical details are needed to connect and get the data from the VLKB). The Milky Way panoramic view is obtained from the post processing of the Hi-GAL survey maps combining 70 μm (blue), 160 μm (green) and 250 μm (red) images using a novel multi-scale local stretching algorithm in false color.

Figure 1.

Figure 1. Galactic Plane navigation on the maps Herschel Hi-GAL maps and rectangular region selection to start the analysis process.

Standard image High-resolution image

The FITS files retrieved within the user selected region, in the chosen survey and wavelength, downloaded from the Search & Cutout service are displayed as shown in Figure 2 and, from this point on, it is possible to start the analysis operations.

Figure 2.

Figure 2. Map region 2D visualization. Color palette (with linear or logarithmic scale), contrast, and saturation can be customized. Pixel values and coordinates are displayed at the bottom. A list of maps and data cubes from all the available surveys within the selected region is shown in the right panel.

Standard image High-resolution image

To better visualize and deeply analyze the astronomical object, the user can perform operations that are common in scientific visualization software: palette (lookup table) can be changed to map pixel values with different colors (with linear or logarithmic scale); the opacity, contrast and saturation can be modified; layered images can be swapped. The status bar of the application shows the position of the mouse pointer over the image expressed as pixel (XY relative to the image), galactic, fk5 and ecliptic coordinates.

The access to the huge amount of information available within the user selected region is always at disposal of the scientist. Indeed, VLVA, for each visualized image, presents a list of 2D maps and 3D datacubes stored in VLKB that come from the available surveys, automatically downloaded, aligned with the displayed base image and over-imposed them as layer. This facilitates the correlation and comparison of information at different wavelengths and that comes from different instruments and space missions.

3.1. Visualization and Analysis of Compact Sources

In comparison to ground-based submillimeter continuum surveys, the data obtained by the Hi-GAL project with the Herschel cameras do not suffer from the varying atmosphere, allowing therefore to recover the rich and highly structured large-scale emission from Galactic cirrus and extended clouds. Such variable and complex background severely hinders the use of traditional methods to detect compact sources based on the thresholding of the intensity image widely used in the case of large scale millimeter and radio surveys from ground-based facilities, like the CORNISH or ATLASGAL where diffuse emission is filtered out either by the atmospheric variations or the instrumental transfer function. Therefore, a new a software tool, called CuTEx, specialized to work in severe background conditions, such as in the Hi-GAL maps, has been developed (Molinari et al. 2011). In short, CuTEx algorithm highlights those regions computing the second derivative of the map. All the "clumps" of pixels above a defined threshold are analyzed and the ones larger than a certain area are kept as candidate detections. The pixels of the large "clumps" are further analyzed to determine if there are more than one statistically significant local maximum of curvature, as expected in the case of very close sources.

The compact sources extracted with the CuTEx package at the wavelength of 70 μm, 160 μm, 250 μm, 350 μm and 500 μm have been associated basing on simple positional criteria for obtaining band merged catalog. The basic step of this process is the association between two source lists corresponding to the detections obtained at two different wavebands λ2 < λ1. An association has been established between a source detected at λ2 and one detected at λ1 if the centroid of the former falls within the ellipse returned by CuTEx as the section at half maximum of the two-dimensional fit to the source profile. Cases of multiplicity, when more than one source fulfills the aforementioned criterion, are removed saving only the association of the closest counterpart at λ1, and leaving the remaining ones unassociated. At this step, a list of entries composed by (i) associations between λ1 and λ2; (ii) unassociated sources at λ1; (iii) unassociated sources at λ2 is returned. All these entries are considered for subsequent association with a third band λ3 < λ2, and so on.

For the Hi-GAL catalog, we start with λ1 = 500 μm and we carry on the associations with λ2 = 350 μm, λ3 = 250 μm and so on, according to the scheme described above. In this way, the band merged catalog will contain sources having counterparts at all the five bands, or in fewer bands (adjacent or not), or even in a single band. The resulting catalog contains not only the photometric information, i.e., the fluxes at each band, but also the positional and morphological information about all the counterparts at all bands. For this reason, a generic source can be associated in the catalog with up to five positions, each differing by a few arcseconds; to provide the user with a reference position, we define as master position the coordinates at the shortest available wavelength.

The data mining approach, named Q-FULLTREE (Quick Full TREe on Ellipse), has been applied to obtain band merged catalogs incorporating to the Hi-GAL catalog also source catalog information from GLIMPSE, GLIMPSE360, WISE, MIPSGAL, ATLASGAL, BGPS and Cornish surveys. It is based on the same positional cross-match algorithm described above, but with a substantial difference in handling possible multiplicities.

In case there is more than one higher resolution source included in the ellipse, the value of the ellipse formula provides a by-product reliable and fast distance estimation of the object from the ellipse center that can be used to assign a different score to all the candidate counterparts included in the same ellipse. Furthermore this scoring system takes implicitly into account the relationship among candidate sources, their bands and dimensions of ellipses.

VLVA represents an invaluable tool for the study of star cluster formation (Molinari et al. 2016), offering visualization and analysis capabilities for compact sources, band-merged catalogs and Spectral Energy Distributions (SEDs).

The scientist is enabled to perform all the operations fundamental to study the compact sources by selecting a sub-region on the map. Sources from the single-band and band-merged catalogs are automatically retrieved from the VLKB and displayed the on top of the image. Compact sources are shown as ellipses with their relative FWHM and position angle using different colors for each wavelength. An example of compact sources visualization is shown in Figure 3.

Figure 3.

Figure 3. Compact Sources visualization. Compact sources are shown in different colors on the image depending on the relative wavelength.

Standard image High-resolution image

VLVA provides features to explore the physical parameters (stored into the VLKB) for each source, such as the SED (see Figure 4). Moreover, 2D plots correlating any of two selected quantities can be easily produced.

Figure 4.

Figure 4. SED analysis panel: the original SED (in dashed lines), the SED with cumulative fluxes, and the theoretical models fit result.

Standard image High-resolution image

Selecting the retrieved band-merged compact sources, the SED analysis toolset can be started. This shows an interactive 2D plot of all the counterparts at different wavelengths, obtained by the Q-FULLTREE processing, that are related to the selected sources.

In case the SED presents multiple associations the fluxes of counterparts can be summed obtaining the SED with a cumulative fluxes to be fitted (see Figure 4).

Three different kind of fitting operations are available: one for the fit with theoretical models (Robitaille et al. 2007) and two for the analytic gray-body fits (e.g., Elia & Pezzuto 2016). Fit operation is performed, by default, on all available wavelengths but a subset can be chosen. The computation is made through routines implemented using Interactive Data Language (IDL)26 that access to the data stored within the VLKB. To avoid IDL dependencies on the VLVA, this fitting engine has been implemented as a remote service to be called by the visual analytics application in a transparent way for the user. The remote engine has been optimized to minimize data transfer and computing effort on the client side. The logging information as well as the results of the computed SED fit are overplotted to the data points (see Figure 4). For theoretical fitting operation, a list of best fitted SED models is displayed once a new fit is performed. Histogram of physical parameters of the SED fitting (e.g., mass and luminosity) can be easily calculated for the selected sources. The user can select/de-select from the resulting list the SED model to visualize it on the plot.

To facilitate the scientific process and, in particular, the correlation between non-homogeneous information (e.g., SED, images, datacubes), the VLVA graphical interfaces are highly interactive. As an example, a selection on each SED point within the SED analysis toolset (Figure 4) highlights the corresponding compact source in Figure 3.

3.2. Extended Bubble-like Sources

To identify the physical agent responsible for triggering star formation in a cloud, it is necessary to search for a relationship between the nature and strength of the triggering agents, and the star formation rate (SFR) and star formation efficiency (SFE) as indicators of the intensity of star formation that they may have triggered. The new-generation Galactic Plane surveys offer now the possibility to investigate these relationships on a variety of spatial scales, from small bubbles around young H ii regions mostly visible in the mid-to-far-infrared and in the radio continuum, to arc-shaped portions of irregular H i superbubbles or SNR shell portions visible in the radio. Star-forming clumps are analyzed close to these potential triggers to verify if SFR and SFE significantly differ from other locations in the Galaxy. Still such morphological evidence alone does not constitute unequivocal proof for a triggered star formation scenario. Compatibility between the average age of a triggered star-forming region and the dynamical age of the triggering agent is another additional check that can be done using the data at hand. A good starting point in this respect can be the bubble-like structures cataloged from the surveys (GLIMPSE and MIPSGAL) obtained by the NASA's Spitzer Space Telescope.27 While GLIMPSE revealed more than 300 candidate bubbles mostly associated with hot young stars, nearly 90% of the ∼400 colder dust bubbles revealed at 24 μm by MIPSGAL are still not identified even if there are strong hints indicating they represent a population of highly obscured, massive objects in later stages of their evolution.

The visual analytic application allows to investigate these regions by superimposing different maps from the available surveys. Figure 5 shows a Hi-GAL map at 70 μm (glow palette) superimposed on a 24 μm MIPSGAL image (pure green palette). This allows to analyze dust emissions and dynamics: cold dust and large grains from the Hi-GAL map on the external ring of the bubble-like structure, and hot dust very fine grained from the MIPSGAL image in the internal part of the region.

Figure 5.

Figure 5. Bubble-like structure visualization. An Hi-GAL map at 70 μm (with glow palette) superimposed on a 24 μm MIPSGAL image (with pure green palette).

Standard image High-resolution image

Bubbles can be checked against 3D molecular spectral datacubes (e.g., from maps of GRS surveys) to study the dust kinematics (as discussed in the next Section 3.3 for filamentary structures).

3.3. Filamentary Structures

Filaments are considered key-structures required to build the densities necessary for star formation (Schisano et al. 2014). Whether driven by turbulence, magnetic field-mediated gravitational contraction, gravo-turbulent processes or shocks in large-scale Galactic flows, the transition from the diffuse and relatively low column density ISM into dense structures on the verge of star formation seems to pass through filamentary structures. Obtaining a complete census and an accurate physical characterization of dense filamentary structures over the entire extension of the Galactic Plane is a key milestone to identify the mechanisms responsible for assembling these structures and to compute the SFE.

Filaments may have highly variable lengths and widths, they may create complex web-like features and have variable orientation, but irrespectively of the pattern complexity they can be locally approximated as cylinder-like geometries and this may greatly facilitate the adoption of very specific algorithmic approaches. In particular, the use of Hessian operators has been employed on the Hi-GAL survey to identify the candidate filaments stored in the VLKB (Schisano et al. 2014).

The candidate filaments are automatically downloaded from the VLKB and displayed on top of the selected region on the base image as shown in Figure 6. Filamentary structures are visualized using a set of three shapes, identifying filaments as the primary object, branches as the "linear" components within them and their spines, and nodes as the connection points of the various branch segments that compose a filament.

Figure 6.

Figure 6. Filamentary structures visualization using a set of three shapes, identifying filaments as the primary object, branches as the "linear" components within them and their spines, and nodes as the connection points of the various branch segments that compose a filament.

Standard image High-resolution image

The visual inspection of filamentary structures can be a tricky task and can lead to misunderstanding results due to prospective effects of objects placed at different distances, therefore not physically connected. Thus, 2D visualization should be extended and validated with other information, in particular, those coming from the inspection of 3D molecular spectral datacubes to verify that a candidate filament belongs to the same kinematic structure (see Figure 7).

Figure 7.

Figure 7. Datacube visualization and isocontours mapping on filamentary structures. This visualization is employed to verify that a candidate filament belongs to the same kinematic structure.

Standard image High-resolution image

The molecular datacube, containing the velocity information, can be selected from the available surveys listed on the visualization window of the VLVA as shown in Figure 2. The datacube 3D reconstruction is rendered using isosurfaces algorithm with chosen thresholds (see Figure 7(a)). The right panel in Figure 7(a) shows a 2D representation of the velocity channel, selected with the cutting plane from the left panel, and related isocontours that can be changed by selecting the number of levels and the upper/lower bounds.

Thanks to the VLVA interactivity between different visualization windows, the isocontours are also placed as additional layer on top of the map shown in Figure 6 to see the velocity mapping within the filamentary structures (see Figure 7(b)).

4. Relevant Technologies

The VIALACTEA Visual Analytics is implemented as a cross-platform client-server application interacting (in a transparent way for the user) with the VLKB and the discovery and access interfaces as shown in Figure 8.

Figure 8.

Figure 8. Visual Analytics application interacting with the ViaLactea Knowledge Base (VLKB). VLKB exposes data and metadata trough the TAP service and the search & cutout service.

Standard image High-resolution image

The core functionalities of VLVA are implemented in C++ and extended with Qt28 for the graphical user interface. Qt enables a tight interactivity between different visualization windows thanks to its powerful events handling.

The adopted rendering engine is the Visualization Toolkit (VTK).29 VTK is used worldwide in commercial applications, research and development, and is the basis of many advanced visualization applications such as ParaView30 VisIt,31 and VisIVO. VTK supports a wide variety of visualization algorithms including scalar, vector, tensor, texture, and volumetric methods; and advanced modeling techniques such as implicit modeling, polygon reduction, mesh smoothing, cutting and contouring. VTK includes an extensive information visualization framework and a suite of 3D interaction widgets; it supports parallel processing and integrates with various databases and GUI toolkits such as Qt. VTK is an open-source toolkit licensed under the BSD license.

The VLVA overall data analysis and management is based on VisIVO Library (Becciani et al. 2012) that makes direct use of VisIVO features mainly for the importing of tabular and 2D/3D images and filtering facilities (such as mathematical operations between physical parameters and computation of statistical information).

The access to VO-compatible databases and file store within the VLKB is performed using the Table Access Protocol (TAP)32 interfaces. The interoperability with the other VO-based tools (e.g., Aladin or TOPCAT) is guaranteed thanks to the implementation of the Simple Application Messaging Protocol (SAMP) (see footnote 6).

As previously said, apart from the Hi-GAL surveys, the other resources were retrieved from public repositories or released to the community retaining their private data policy. Therefore the access to the VLKB infrastructure has been secured via an authentication layer (currently based on HTTP basic access authentication). Alongside the data collections, a relational database (RDB) completes the VLKB resource content in terms of knowledge derived from the data and contains information related to: filaments and bubbles, compact sources and radio cubes.

All the data and metadata contents are discoverable and accessible thanks to the interfaces that have been put on top of the database and storage system: a TAP service and a set of dedicated interfaces to consume stored data sets. While the TAP interface is useful for generic tabular access, it is less powerful to perform positional discovery and is not designed to provide direct access to data files. Therefore the search & cutout service has been implemented to

  • 1.  
    identify the map images that overlap a region along a specific line of sight on the celestial sphere and described by a circle or a rectangular range around it;
  • 2.  
    cutout the data files on positional constraints and also on the velocity axis (if 3D cubes are investigated) to allow for efficient data transport on the net and work only on the significant part of data; and
  • 3.  
    merge adjacent data sets into a single image/cube based on positional and velocity bounds.

5. Summary and Future Works

We presented a visual analytics application that allows to combine 2D and 3D visualization with knowledge discovery and data mining methodologies accessing to new-generation surveys of the Galactic Plane to facilitate the studies of stars formation structures within our Galaxy.

The application has been tailored to analyze compact sources with SEDs and extended sources such as bubbles and filaments. We demonstrated, through use cases, how the tool can easily support these analyses offering both qualitative and quantitative visualization capabilities.

Possible extensions to the current implementation that we are considering, include (1) enabling multiple users to interact with the same data set simultaneously, which will enable collaborative data analysis and visualization; (2) further optimizing the rendering and processing time using e.g., GPU technologies as in (Hassan et al. 2013); and (3) investigating the use of European Open Science Cloud (EOSC) technologies for archive services related to the VLKB and intensive SED analysis employing the connection with the ViaLactea Science Gateway (Sciacca et al. 2017).

The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement No. 607380 (VIALACTEA) and from the European Commissions Horizon 2020 research and innovation programme under the grant agreement No. 739563 (EOSCPilot.eu).

Footnotes

Please wait… references are loading.
10.1088/1538-3873/aac5d2