The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Anecdotal evidence suggests that the variety of Big data is one of the most challenging problems in Computer Science research today [Stonebraker, 2012], [Ou et al., 2017], [Guo et al., 2016], [Bai et al., 2016]. First, Big data comes at us from a myriad of data sources, hence its shape and flavor differ. Second, hundreds of data management systems which work with Big data support different APIs and...
High-level synthesis (HLS) is well capable of generating control and computation circuits for FPGA accelerators, but still requires sufficient human effort to tackle the challenge of memory and communication bottlenecks. One important approach for improving data locality is to apply loop tiling on memory-intensive loops. Loop tiling is a well-known compiler technique that partitions the iteration...
A heterogeneous memory system (HMS) consists of multiple memory components with different properties. GPU is a representative architecture with HMS. It is challenging to decide optimal placement of data objects on HMS because of the large exploration space and complicated memory hierarchy on HMS. In this paper, we introduce performance modeling techniques to predict performance of various data placements...
Reading and writing data efficiently from storage system is necessary for most scientific simulations to achieve good performance at scale. Many software solutions have been developed to decrease the I/O bottleneck. One well-known strategy, in the context of collective I/O operations, is the two-phase I/O scheme. This strategy consists of selecting a subset of processes to aggregate contiguous pieces...
Nowadays environmental science experiences tremendous growth of raster data: N-dimensional (N-d) arrays coming mainly from numeric simulation and Earth remote sensing. An array DBMS is a tool to streamline raster data processing. However, raster data are usually stored in files, not in databases. Moreover, numerous command line tools exist for processing raster files. This paper describes a distributed...
Most GPU-based graph systems cannot handle large-scale graphs that do not fit in the GPU memory. The ever-increasing graph size demands a scale-up graph system, which can run on a single GPU with optimized memory access efficiency and well-controlled data transfer overhead. However, existing systems either incur redundant data transfers or fail to use shared memory. In this paper we present Graphie,...
The accuracy of PV array model directly affects the output characteristics of the PV system. In this paper, a simplified engineering model for PV array is established and the influence on the output characteristics of PV array is analyzed when Rs, Rsh and α, β, γ change. According to the actual data of a PV power station, the shuffled frog leaping algorithm (SFLA) is adopted to identify the parameters...
The photovoltaic (PV) array is one of the main components of the PV system, and the accuracy of the PV array model is directly related to the validity of the simulation results. The parameters of the PV array may change with the operation conditions. Therefore, it is important to identify the parameters of the PV array model according to the measured data. In this paper, the conventional four-parameter...
Computing machine learning models in the cloud remains a central problem in big data analytics. In this work, we introduce a cloud analytic system exploiting a parallel array DBMS based on a classical shared-nothing architecture. Our approach combines in-DBMS data summarization with mathematical processing in an external program. We study how to summarize a data set in parallel assuming a large number...
This paper proposes the most efficient I/O-based out-of-core stencil algorithm for large-capacity type of non-volatile memory (NVM), such as flash. The paper evaluates the performances of various out-of-core stencil algorithms and implementations designed for flash. The algorithms for flash are very different from existing algorithms designed for memory-and-cache, host-and-GPU, and local-and-remote...
In an organisation several Business Process Management System (BPMS) products can co-exist and work alongside each other. Each one of these BPM tools has its own definition of process instances, creating a heterogeneous environment. This reduces interoperability between business process management systems and increases the effort involved in analysing the data. In this paper, we propose a common model...
Per-flow counting for big network data streams is a fundamental problem in various network applications such as traffic monitoring, load balancing, capacity planning, etc. Traditional research focused on designing compact data structures to estimate flow sizes from the beginning of the data stream (i.e., landmark window model). However, for many applications, the most recent elements of a stream are...
A method to extract an equivalent array of dipole-moments model from only the magnitude data of near-field scan is proposed. The method can extract both magnitude and phases of the equivalent dipole-moments by using the nonlinear least square (NLS) method, although the information used for source modeling is limited to only the field magnitudes. The truncated singular-value decomposition (SVD) method...
Finding the best model to reveal potential relationships of a given set of data is not an easy job and often requires many iterations of trial and errors for model sections, feature selections and parameters tuning. This problem is greatly complicated in the big data era where the I/O bottlenecks significantly slowed down the time needed to finding the best model. In this article, we examine the case...
Spectroscopy is an important component analysis method and full spectrum prediction method may be complicated and inaccurate. In order to find out the irrelevant variables, a variable selection method based on the frequent pattern tree(FP-tree) is proposed in this paper. The proposed method firstly formulates an orthogonal array to generate wavelength selection plans, which makes the wavelength selection...
Heterogeneous computing systems, e.g., those with accelerators than the host CPUs, offer the accelerated performance for a variety of workloads. However, most parallel programming models require platform dependent, time-consuming hand-tuning efforts for collectively using all the resources in a system to achieve efficient results. In this work, we explore the use of OpenMP parallel language extensions...
High-performance distributed memory applications often load or receive data in a format that differs from what the application uses. One such difference arises from how the application distributes data for parallel processing. Data must be redistributed from how it was laid out by the producer to how the application needs the data to be laid out amongst its processes. In this paper, we present a large-scale...
Recently, a massive amount of data is generated in a wide range of scientific applications such as NASA's satellite, the large hadron collider, and large synoptic survey telescope. Most of scientific data follows the array model, and there are various kinds of standard array formats such as HDF, NetCDF, MDSplus, and ROOT. SciDB is the most well-known DBMS that stores the array-based scientific data...
Advancements in high level integration of MMIC in package have led to the development of a system comprising of a multi-channel transceiver with antenna in package (AiP). Multi-channel systems provide additional advantage by enabling digital beamforming for direction of arrival estimation at 60 GHz. Different beamforming techniques are studied and practically implemented with a radar platform to find...
The ability to perform a top-level simulation of a processor is hindered by simulation performance, which can be much slower than unit-level test benches. Simulation performance is also greatly reduced by the presence of an increasing number of cores in a top-level test bench. In this paper, we present the implementation of the Fake CPU UVC, a bus functional model which emulates a real CPU's behavior...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.