Search for a command to run...
Content This dataset comprises measurements collected from six low-speed synchronous generators located in different hydroelectric power plants operated by EDF, denoted A through F. For each machine, 15 operating points were recorded, resulting in a total of 90 instances acquisitions across the six generators. Each instance is described by 16 columns: 14 frequency-domain features (ranging from 3.3 Hz to 46.7 Hz), a binary label y (where 0 indicates the absence of inter-turn short circuit (ITSC) and 1 indicates its presence), and the machine identifier (A–F). Machine characteristics The main characteristics of the machines are reported below. Characteristic Machines A, B, C Machines D, E, F Rated active power (MW), $P_n$ 55 65 Rated reactive power (MVAR), $Q_n$ 35 50 Rated stator current (A) 3000 3700 Number of poles, $p$ 30 30 Rotational speed (rpm) 200 200 Supply voltage (V) 10000 10000 Frequency (Hz), $m$ 50 50 Data acquisition The dataset was collected by measuring the stray magnetic field around each generator using a tri-axial flux sensor. This sensor consists of three orthogonal coils aligned along the normal (NOR), axial (AXI), and tangential (TAN) directions, and is attached to the external frame of the machine using magnetic supports. The sensor is directly connected to the acquisition system, without additional signal conditioning. In this study, only the tangential (TAN) component is considered. This component, oriented perpendicular to the machine’s longitudinal axis, is less sensitive to axial magnetic effects (e.g., from winding ends) and provides an indirect measurement of the radial air-gap flux, although attenuated by the stator structure. Measurements were conducted over 15 operating points per machine, defined by combinations of three active power levels ($10\%$, $50\%$, and $100\%$ of the nominal power $P_n$) and five reactive power levels ($-100\%$, $-50\%$, $0\%$, $+50\%$, and $+100\%$ of the nominal reactive power $Q_n$). For each operating condition, the sensor signals were recorded using a National Instruments PXI system controlled via a LabVIEW interface for raw data acquisition. Each recording spans 5 seconds at a sampling rate of 20 kHz, producing a univariate time series of length $T = 100 000$. These raw time series are not available in this dataset. Data transformation The raw time-series signals were converted into frequency-domain features using the discrete Fourier transform (DFT). Only harmonics corresponding to integer multiples of the fundamental electrical frequency $m$ (50 Hz), scaled by the number of pole pairs $\frac{p}{2}$ (15 in this case), and falling within the lower sideband (i.e., below the fundamental frequency) were retained for analysis. The feature set is constructed by normalizing the amplitudes of these harmonics with respect to the fundamental component.This procedure results in a feature vector of 14 normalized frequency components for each signal.