The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The success of deep learning in vision can be attributed to: (a) models with high capacity; (b) increased computational power; and (c) availability of large-scale labeled data. Since 2012, there have been significant advances in representation capabilities of the models and computational capabilities of GPUs. But the size of the biggest dataset has surprisingly remained constant. What will happen...
In this paper, we consider a millimeter wave (mmWave) multiple-input multiple-output (MIMO) communication system with hybrid beamforming architecture. To search the optimal beam pair, the beam sweeping algorithm is complex and time-consuming in the case of initial access and beam switching. With the aim of improving the robustness, we propose a novel beam management scheme to quickly determine the...
This paper focuses on temporal localization of actions in untrimmed videos. Existing methods typically train classifiers for a pre-defined list of actions and apply them in a sliding window fashion. However, activities in the wild consist of a wide combination of actors, actions and objects; it is difficult to design a proper activity list that meets users’ needs. We propose to localize activities...
Temporal Action Proposal (TAP) generation is an important problem, as fast and accurate extraction of semantically important (e.g. human actions) segments from untrimmed videos is an important step for large-scale video analysis. We propose a novel Temporal Unit Regression Network (TURN) model. There are two salient aspects of TURN: (1) TURN jointly predicts action proposals and refines the temporal...
The concept of pseudo electrostatic spring constant is introduced and a new system architecture incorporating which in an electromechanical sigma-delta modulator (EM-SDM) is proposed in this paper. The zeros in the quantization noise transfer function, corresponding to the sensor mechanical poles in the proposed system architecture, can be adjusted and optimized without affecting the sigma-delta closed-loop...
Rich and dense human labeled datasets are among the main enabling factors for the recent advance on visionlanguage understanding. Many seemingly distant annotations (e.g., semantic segmentation and visual question answering (VQA)) are inherently connected in that they reveal different levels and perspectives of human understandings about the same visual scenes — and even the same set of images (e...
In this work, we present WaveLight, a monolithic silicon-photonics platform whereby a low latency reliable deterministic protocol with optical functions are designed directly into an existing high-volume CMOS process.
Electrical stimulation therapy (EST) of lower esophageal sphincters (LES) is a new technique for the treatment of gastroesophageal reflux disease (GERD). In this paper, an implantable LES stimulator with wireless power transmission is proposed for the treatment of GERD. The LES stimulator is composed of an implantable pulse generator (IPG), an external controller, and a wireless power transmission...
The goal of this paper is to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance for a given application and platform. To this end, we investigate various ways to trade accuracy for speed and memory usage in modern convolutional object detection systems. A number of successful systems have been proposed in recent years, but apples-toapples...
Robot has been receiving increasing attention and robot in education is considered a promising aid for teaching and learning in many ways. This study attempts to investigate the need of the educational robot among six different user groups (pre-school user, primary school user, high school user, college user, adult user and elderly user). A triangulation technique was applied for cross validation,...
In this paper a side-channel-attack resistant AES system with a variation-tolerant true Random Number Generator (tRNG) is implemented using IBM 0.13μm CMOS technology. As the random source for the AES, a meta-stability based tRNG takes advantage of an all-digital self-calibration method to compensate Process-Voltage-Temperature (PVT) variations, and thus guarantees output with extremely high randomness...
In this work, we provide an overview of the technology and architecture of a microprocessor chip with optical I/O. Zero-change photonics integration enabled the chip to be fabricated in a commercial electronics CMOS foundry.
The huge training overhead for obtaining channel state information (CSI) at the BS has been recognized as a major challenge in frequency division duplex (FDD) massive multiple-input multiple-output (MIMO) cellular networks. To solve this problem, we propose an angular domain pilot design and channel estimation scheme to reduce the required overhead by exploiting the angle domain channel sparsity....
Outdoor sports equipment without maintenance usually caused injury to users. This paper proposes to design an App notification system combine with IoT to monitor the equipment status. We converted our action signal into frequency domain for analysis. But the useless signal might decline classifier efficiency. Therefore we propose an empirical support vector machine (SVM) to reduce the useless signal...
We propose to leverage concept-level representations for complex event recognition in photographs given limited training examples. We introduce a novel framework to discover event concept attributes from the web and use that to extract semantic features from images and classify them into social event categories with few training examples. Discovered concepts include a variety of objects, scenes, actions...
This paper presents an improved ensemble Kalman filter with an autonomous underwater vehicle carrying source for sound speed profiles inversion. The Markov chain Monte Carlo method is introduced to improve the variability of ensemble members in the traditional ensemble Kalman filter and thus, improve the performance. The inverse problem is formulated as a state-space model with a state equation for...
Physical layer secret key generation techniques have spurred much recent research interest. In general, the secret key establishment contains four main procedures: channel sounding, quantization, information reconciliation and privacy amplification. However, to overcome the non- reciprocity and correlation problems in key generation from orthogonal frequency division multiplexing (OFDM) channels,...
As the de facto data plane technique of Software-Defined Networking (SDN), OpenFlow introduces significant programmability to enable innovative network applications. However, the simple OpenFlow data plane only maintains flow-level counters and lacks an efficient mechanism to manage network-level states, which limits its support for advanced state-aware applications. Regularly pulling whole state...
A high-sensitivity, fully-differential optical receiver for high-density photonic interconnects is presented. To realize fully-differential operation, a 3-dB power splitter and SiGe photodetector are integrated with the receiver, all in a CMOS 45nm SOI process. The proposed receiver improves sensitivity by suppressing common-mode and supply noise through fully-differential (FD) operation, achieving...
Palette mode is the new coding tool that has been adopted in the Screen Content Coding Extensions of High Efficiency Video Coding (HEVC SCC). Palette mode can represent colour clusters for screen content efficiently and can be summarized into two parts: palette coding tools and colour index map coding tools. This paper proposes two techniques to improve colour index map coding: transition copy and...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.