Measurement of the Low-x Behaviour of the Photon Structure Function F2-gamma

The photon structure function F2-gamma(x,Q**2) has been measured using data taken by the OPAL detector at centre-of-mass energies of 91Gev, 183Gev and 189Gev, in Q**2 ranges of 1.5 to 30.0 GeV**2 (LEP1), and 7.0 to 30.0 GeV**2 (LEP2), probing lower values of x than ever before. Since previous OPAL analyses, new Monte Carlo models and new methods, such as multi-variable unfolding, have been introduced, reducing significantly the model dependent systematic errors in the measurement.


Introduction
The measurement of the hadronic photon structure function F γ 2 (x, Q 2 ) is a classic test of QCD predictions [1][2][3]. The structure function of the photon differs from that of the proton because the photon can couple directly to quark charges, as well as fluctuate into a hadronic state. The value of F proton 2 (x, Q 2 ) exhibits a clear rise towards low values of Bjorken x [4,5]. Some theoretical models predict a similar rise in the photon structure function [6][7][8], while other models do not require such a rise [9]. Experimentally, a rise at low-x in the photon structure function has neither been observed nor excluded [10][11][12][13][14][15][16][17][18][19]. The photon structure function is studied at LEP using samples of events of the type e + e − → e + e − γ ⋆ γ → e + e − + hadrons. The analysis presented here uses single-tagged events (from here on referred to as γ ⋆ γ events), which means that only one of the scattered beam electrons 1 is observed in the detector. These events can be regarded as deep inelastic scattering of an electron off a quasi-real target photon, and the flux of quasi-real photons can be calculated using the equivalent photon approximation [1]. Figure 1 shows a diagram of this reaction, in which k is the fourvector of the incoming electron which radiates the virtual photon, and q and p are the four-vectors of the virtual photon and the quasi-real photon, respectively. The symbol f q,γ (x, Q 2 ) represents the parton densities of the quasi-real photon. The cross-section for deep inelastic electron-photon scattering can be written in terms of structure functions as [3] where Q 2 = −q 2 is the negative value of the four-momentum squared of the virtual probe photon, α is the fine structure constant and x and y are the usual dimensionless deep inelastic scattering variables, defined by In the kinematic region studied in this paper, y 2 ≪ 1, so the contribution from the longitudinal photon structure function F γ L in Equation 1 is neglected.
In contrast to measurements of the proton structure function, here the energy of the target particle is not known. In consequence, the kinematics cannot be fully determined without measuring the hadronic final state, which is only partially observed in the 1 For conciseness, positrons are also referred to as electrons.
detector. This leads to a dependence of the F γ 2 measurement on Monte Carlo modelling of the hadronic final state, which enters when an unfolding process is used to relate the visible distributions to the underlying x distribution.
The analysis presented here uses OPAL data collected during the years 1993-1995, 1997 and 1998, at e + e − centre-of-mass energies of 91 GeV (LEP1), and 183 GeV and 189 GeV (LEP2). This is the first OPAL measurement of the four-flavour photon structure function using the 183 GeV and 189 GeV data, though the charm contribution to F γ 2 has already been measured [20]. In this analysis new Monte Carlo programs and improved unfolding methods have been introduced, which are also used in the re-analysis of the LEP1 data.

The OPAL detector
The OPAL detector is described in detail elsewhere [21]; only the subdetectors which are most relevant for this analysis, namely the low angle electromagnetic calorimeters and the tracking devices, are detailed below. The OPAL detector has a uniform magnetic field of 0.435 T along the beam direction throughout the central tracking region, with electromagnetic and hadronic calorimetry and muon chambers outside the coil.
The forward detectors (FD) cover the θ region from 60 to 140 mrad at each end of the OPAL detector 2 . They consist of cylindrical lead-scintillator calorimeters with a depth of 24 radiation lengths (X 0 ) divided azimuthally into 16 segments. The energy resolution for electromagnetic showers is 18%/ √ E, where E is in GeV. An array of three planes of proportional tubes buried in the calorimeter at a depth of 4 X 0 provides a precise shower position measurement, with a typical resolution of 3-4 mm, corresponding to 2.5 mrad in θ, and less than 3.5 mrad in φ.
The small-angle silicon tungsten luminometer (SW) covered the region in θ from 25 to 60 mrad from 1993-1995. For LEP2 running, a radiation shield was installed which moved the lower edge of the useful SW acceptance to 33 mrad. The SW detector contains 19 layers of silicon alternating with tungsten. Each of the 16 azimuthal wedges is divided into 64 pads for positional measurement. The energy resolution of the SW detector is about 24%/ √ E at LEP1 energies, and about 6% at LEP2.
Charged particles are detected by a silicon microvertex detector, a drift chamber vertex detector, and a jet chamber. Outside the jet chamber, but still inside the magnetic field, lies a layer of drift chambers whose purpose is to improve the track reconstruction in the z-coordinate. The resolution of the transverse 3 momentum for 2 In the OPAL right-handed coordinate system the x-axis points towards the centre of the LEP ring, the y-axis points upwards and the z-axis points in the direction of the electron beam. The polar angle θ and the azimuthal angle φ are defined with respect to the z-axis and x-axis, respectively. 3 Transverse is always defined with respect to the z-axis.
charged particles is σ pt /p t = 0.02 2 + (0.0015 p t ) 2 for | cos θ| < 0.7 , where p t is in GeV, and degrades for higher values of | cos θ|. Tracks are accepted up to a limit of | cos θ| < 0.9622.
Both ends of the OPAL detector are equipped with electromagnetic endcap calorimeters covering the range from 200 to 630 mrad in polar angle. They are homogeneous devices composed of arrays of lead-glass blocks of 9.2 × 9.2 cm 2 cross-section and typically 22 X 0 in depth, giving good shower containment. In the central region, outside the solenoid, is the electromagnetic barrel calorimeter of similar construction.
The deep inelastic scattering events are triggered with high efficiency by the large energy deposits of the scattered electron in the forward calorimeters (FD and SW) and by charged particles seen in the tracking devices.

Kinematics and data selection
To measure F γ 2 (x, Q 2 ), the distribution of events in x and Q 2 is needed. These variables, illustrated in Figure 1, are related to the experimentally measurable quantities and x = Q 2 Q 2 + W 2 + P 2 (4) where E b is the energy of the beam electrons, E tag and θ tag are the energy and polar angle of the deeply inelastically scattered (or 'tagged') electron, W 2 is the invariant mass squared of the hadronic final state and P 2 = −p 2 is the negative value of the virtuality squared of the quasi-real photon. The requirement that the associated electron is not seen in the detector ensures that P 2 ≪ Q 2 , so P 2 is neglected when calculating x from Equation 4. The electron mass is neglected throughout.
Three samples of events are studied in this analysis, classified according to the centre-of-mass energy and the subdetector in which the tagged electron is observed. Data from LEP1 are used, with e + e − centre-of-mass energies close to the Z 0 mass, as well as LEP2 data with centre-of-mass energies of 183 GeV and 189 GeV. Electrons are tagged using the SW detector at all centre-of-mass energies, since accessing the lowest possible x region requires measuring the electrons scattered at the lowest possible angles. The FD is only used for the LEP1 data, to provide a sample in the same range of Q 2 as the LEP2 sample, but using a different subdetector for tagging the scattered electron. The three samples used are termed LEP1 SW, LEP1 FD and LEP2 SW. Each sample is further split into two bins of Q 2 .
Events are selected by applying cuts on the scattered electrons and on the hadronic final state. The cuts are listed in Table 1. A tagged electron within the clear acceptance of SW or FD is selected by requiring E tag ≥ 0.75 E b at LEP1 or E tag ≥ 0.775 E b at LEP2. This cut effectively eliminates events originating from random coincidences between off-momentum 4 beam electrons faking a tagged electron and untagged γγ events. The cut is higher at LEP2 because of the larger off-momentum background in the 183 GeV data. To ensure that the virtuality of the target photon is small, the highest energy cluster in the hemisphere opposite the tagged electron must have an energy E a ≤ 0.25 E b (the anti-tag condition). To reject background from γ ⋆ γ events with leptonic final states, the number of tracks in the event passing quality cuts and originating from the hadronic final state, N ch , must be at least three, of which at least two tracks must not be identified as electrons. The tracks and the calorimeter clusters are reconstructed using standard OPAL techniques [22] which avoid double counting of particles which produce both tracks and clusters. Finally, the visible invariant mass W vis of the hadronic system based on tracks and calorimeter clusters and including the contribution from energy measured in the forward calorimeters (FD and SW) is required to be in the range 2.5 GeV ≤ W vis ≤ 40 GeV for the LEP1 sample and 2.5 GeV ≤ W vis ≤ 60 GeV for the LEP2 sample. The distribution of W vis extends to higher values at LEP2 because more energy is available from the beam electrons. In addition, the background from Z 0 → hadrons, which the maximum W vis cut is designed to reject, is lower at LEP2 than at LEP1.
The trigger efficiencies were evaluated from the data using sets of separate triggers and found to be larger than 99% for all of the samples.
The cuts applied to each sample are listed in Table 1 and shown as dotted lines in Figures 2-7. The numbers of events in each sample passing the cuts, the integrated luminosities and the Q 2 ranges are listed in Table 2. The luminosity for the LEP1 data is lower for the SW sample than for the FD sample because the SW sample does not include data from 1995.

Monte Carlo and background
Monte Carlo programs are used to simulate γ ⋆ γ events and to provide background estimates. All Monte Carlo events are passed through the OPAL detector simulation [23] and the same reconstruction and analysis chain as real events.
It has been seen in previous studies [16,18,19] that Monte Carlo modelling of the hadronic final state is a large source of systematic error in the F γ 2 measurement. While there is still no γ ⋆ γ Monte Carlo generator which describes all the features of the observed hadronic final state, improved Monte Carlo programs have become available since the previous OPAL measurements of F γ 2 and it is now possible to reject some of the models which do not describe the data adequately.
The Monte Carlo generators used to simulate signal events are HERWIG 5.9 [24], HERWIG 5.9+k t (dyn) [25], PHOJET 1.05 [26], and F2GEN [27]. HERWIG 5.9 is a general purpose Monte Carlo program which includes deep inelastic electron-photon scattering. The HERWIG 5.9+k t (dyn) version uses a modified transverse momentum distribution, k t , for the quarks inside the photon, with the upper limit dynamically (dyn) adjusted according to the hardest scale in the event, which is of order Q 2 . The distribution was originally tuned for photoproduction events at HERA [28]. PHO-JET 1.05 simulates hard interactions through perturbative QCD and soft interactions through Regge phenomenology and thus is expected to provide a more complete picture of γ ⋆ γ collisions than HERWIG 5.9. It is used here for the first time in an OPAL F γ 2 analysis. In F2GEN, γ ⋆ γ events are generated with a pure qq final state. No QCD effects between the two quarks are simulated, and the pointlike mode was used, in which the angular distribution of the qq final state is taken to be the same as for a lepton pair. The generated luminosities of the Monte Carlo samples were 3-6 times the data luminosity.
A sample was generated with each of the Monte Carlo models using the GRV LO [6] parameterisation of F γ 2 , taken from the PDFLIB library [29], as the input structure function. This version assumes massless charm quarks. To study the effect of a different input structure function on the final state, another sample was generated using HERWIG 5.9 5 with the SaS1D [7] parameterisation of F γ 2 .
All of the Monte Carlo programs except PHOJET 1.05 allow generation of events and cross-section calculation according to a chosen structure function. PHOJET 1.05 uses an input structure function for the total cross-section calculation but always produces the same x and Q 2 distributions. The measurement of F γ 2 requires knowledge of the structure function used to generate these distributions, so the effective internal structure function in PHOJET 1.05 was found by comparing the generator level distributions to those of HERWIG 5.9+k t (dyn) with the GRV LO structure function. The ratio of the x distributions in the two samples for each Q 2 range was fitted to a polynomial, which, after multiplying by the GRV LO structure function, gave the structure function for the PHOJET 1.05 sample.
The dominant background comes from the reaction e + e − → e + e − γ ⋆ γ → e + e − τ + τ − proceeding via the multiperipheral process shown in Figure 1, with e + e − → e + e − γ ⋆ γ → e + e − e + e − giving a smaller contribution. These were simulated using the Vermaseren program [30]. The process Z 0 → hadrons also contributes significantly at LEP1 and was simulated using JETSET [31]. Because the aim is to measure the structure function of the quasi-real photon, the hadronic γ ⋆ γ ⋆ events are also treated as background. They were generated using PHOJET 1.05 with the virtuality of the target photon restricted to P 2 >1.0 GeV 2 (LEP1) or 4.5 GeV 2 (LEP2). Other sources of background considered were Z 0 → τ + τ − , simulated with KORALZ [32], non-multiperipheral fourfermion events with e + e − τ + τ − and e + e − qq final states, which were simulated with FERMISV [33], and W + W − → hadrons and untagged γγ events, which were both simulated with PYTHIA [34]. The contributions from all these were found to be negligible in all the samples. The number of expected signal and background events for each data sample is shown in Table 3.

Figures 2, 3 and 4 show comparisons between data and Monte
Carlo distributions for variables relating to the scattered electrons. The main differences are in the LEP1 SW data sample, where the observed cross-section for selected events is significantly higher than either of the Monte Carlo predictions. The LEP1 SW data sample is more peaked than the HERWIG 5.9+k t (dyn) prediction at low θ tag , and there are large discrepancies between the LEP1 SW data sample and the Monte Carlo samples in the distribution of E a /E b (Figure 2d), with the data having an excess at the low end of the plot. However, in this region, the anti-tag distribution is more influenced by the hadronic final state than by the anti-tagged electrons, which are almost all above the cut.
In the distributions of variables related to the hadronic final state ( Figures 5-11), there are large discrepancies. Figure 8 shows the hadronic energy flow as a function of pseudorapidity. It can be seen that HERWIG 5.9 tends to put too little energy in the central region of the detector, while F2GEN has a much higher peak in the central region than the data. PHOJET 1.05 gives the best description of the data in these plots.
where i runs over all particles in the hadronic final state, with transverse energy E i,t . The azimuthal angle φ i is measured from the tagged electron. HERWIG 5.9 has too few events at large E out T and F2GEN has too many, especially at low values of x vis . Also shown in Figures 9, 10 and 11 is E for /E tot , the observed energy in the forward region divided by the total observed energy. Of the four Monte Carlo models, PHO-JET 1.05 shows the best agreement with the data in the hadronic final state, HER-WIG 5.9+k t (dyn) describes the data better than HERWIG 5.9, and F2GEN gives the worst description of the data. For the final results, only PHOJET 1.05 and HERWIG 5.9+k t (dyn) are used; however, all four Monte Carlo models are used in studies of the unfolding methods.
None of the Monte Carlo programs discussed above contain radiative corrections to the deep inelastic scattering process. These are dominated by initial state radiation from the deeply inelastically scattered electron. The final state radiation is experimentally integrated out due to the finite granularity of the calorimeters. The Compton scattering process contributes very little, and the radiative corrections due to radiation of photons from the electron that produced the quasi-real target photon were shown to be very small [35].
In this analysis the radiative corrections have been evaluated using the RADEG program [36], which includes initial state radiation and the Compton scattering process. The calculations are performed using mixed variables, which means that Q 2 is calculated from electron variables, while W 2 is calculated from hadronic variables. The value of x is found from W 2 and Q 2 , exactly as for the experimental analysis. The RADEG program allows calculation of differential cross-sections in bins of x and Q 2 , with or without radiative corrections, while applying the experimental restrictions on the electron energy and polar angle, the minimum invariant mass and the anti-tag angle. The predicted ratio of the differential cross-section for each bin of the analysis is used to correct the data, i.e. the measured F γ 2 is multiplied by the ratio of the non-radiative and the radiative cross-sections. The radiative corrections reduce the cross-section in the phase space of the present analysis. They are largest at low values of x. The size of the largest correction to any bin is 14%. The radiative corrections are insensitive to the choice of F γ 2 in the calculation, for example, the difference between the predicted corrections for the GRV and the QPM structure functions is negligible in all regions of x and Q 2 .
Several theoretical ansatzes exist for how F γ 2 should behave as a function of P 2 [37][38][39]. They all predict a decrease of F γ 2 (x, Q 2 , P 2 = 0) with P 2 which is strongest at low values of x. This means that applying a correction to obtain an F γ 2 which is valid at P 2 = 0 would change the shape of the measured F γ 2 . The size of the effect was studied based on the GRS [37] and SaS1D [38] parameterisations using three values of P 2 : 0.01, 0.02 and 0.03 GeV 2 , and four values of Q 2 : 1.9, 3.7, 9.5 and 17.6 GeV 2 . These values reflect the range of P 2 values expected in the data, and the Q 2 values of the data samples. The largest suppression observed at x = 0.003 for P 2 = 0.01/0.02/0.03 GeV 2 is around 7/12/17%; however, at low x the predictions differ by more than a factor of two. Because the distribution of P 2 in the data is not known and the theoretical predictions differ significantly, no correction is applied.

Unfolding
Unfolding is a statistical technique which is used in this analysis to correct the measured distributions for detector effects. The unfolding problem can be described as follows: A measurement g j is made of a true distribution f i , with j and i bins respectively. The Events in a certain bin of the true distribution can be spread into several bins in the measured distribution. In γ ⋆ γ events this can be due to missing energy in the forward region (FD and SW plus the beampipe), as well as the intrinsic energy resolution of the subdetectors.
• Limited acceptance: Some events fail the cuts and therefore contribute only to the true x distribution, and not to the measured distribution.
• Background: There are also events originating from other reactions which only contribute to the measured distribution.
Resolution and acceptance effects can be described in terms of a response matrix A ij . The distributions g j and f i are then related by where b j is the background distribution. Equation 6 cannot simply be inverted to find f i because this does not lead to a stable solution 6 . Instead, after b j is subtracted, f i is estimated from a procedure which includes a smoothing (or regularisation) of the distribution to reduce statistical fluctuations. The response matrix A ij is found from Monte Carlo simulation. After unfolding, F γ 2 is determined by reweighting the input structure function of the Monte Carlo according to the ratio of the unfolded x distribution to the x distribution in the Monte Carlo: where F γ 2,MC is the input structure function of the Monte Carlo sample used for unfolding, N MC,i is the number of events in the Monte Carlo sample in the ith bin and f (x) is the unfolded x distribution, obtained from a quadratic spline fit to f i , with f (x) = 0 at the edges of the acceptance region. F γ 2,i represents the measured value of the average F γ 2 of the events in bin i of the unfolded distribution, and not the average over x. The distinction is important mainly at low x, where f (x) changes most rapidly.
The main unfolding program used for this analysis was GURU [40]; the programs RUN [41] and BAYES [42] were used to check the unfolding. In previous OPAL F γ 2 analyses, only RUN was used. The programs GURU and RUN are similar in principle, but have different implementations. Due to its use of standard matrix techniques, GURU has a much simpler structure and is easier to modify than RUN, which uses a maximum likelihood fit with spline functions. The BAYES program performs unfolding using an iterative method based on Bayes' theorem, leaving regularisation to the user (e.g. a polynomial fit after each iteration except the last). The programs GURU, RUN and BAYES were compared by unfolding test distributions including experimental data. The result of one such study, unfolding the LEP1 SW data with HERWIG 5.9+k t (dyn), using the measured distribution x vis , is shown in Figure 12a. The results from the three programs are generally consistent, though the BAYES program tends to assign smaller errors than the other two.
The amount of regularisation in the unfolding is set by the number of degrees of freedom (NDF). It can be seen in Figure 12b that increasing this number leads to larger unfolding errors. Reducing the number of degrees of freedom increases the correlations between the bins. The number of degrees of freedom to be used is determined by the GURU program from statistical analysis of the data distributions, as described in the GURU documentation [40].
The unfolding of the variable x is done on a logarithmic scale in order to study the low-x region in more detail. Several improvements to the unfolding procedure used in previous OPAL analyses have been implemented. These are described in the following sections.

Reconstruction of W
The visible hadronic mass squared is defined by where i runs over all tracks and clusters in the hadronic final state, with energy E i and momentum vector (p i,x , p i,y , p i,z ). Because hadronic showers are not well contained in the electromagnetic calorimeters, and some energy is lost in the beampipe, the measured energy in the forward region is less than the true energy. It is possible to improve the reconstruction of W vis by including kinematic information from the tagged electron [43]. W 2 vis can also be written as where p i± = E i ± p i,z and p i,t = p i,x 2 + p i,y 2 . Using conservation of energy and momentum, and assuming that the untagged electron travels along the beam direction, the sums over p i+ and p i,t can be replaced by electron variables, leading to where p beam+ and p tag+ are calculated for the tagged electron before and after scattering, respectively, and p tag,t is calculated according to the definition of p i,t , above, for the tagged electron. When using Equation 10 instead of Equation 9 to evaluate the measured hadronic mass of each event, the hadronic energy enters only through the p i− term. This is an advantage because the leptonic energy resolution is usually better than the hadronic energy resolution. The (reconstructed) variable formed in this way is called W rec . The corresponding measurement of x is called x rec . The improvement in the resolution when using W rec instead of W vis depends on how well the hadronic system is measured, and is therefore both detector dependent and model dependent.
Even after the above technique has been applied, the value of W rec is still generally smaller than the true value. This is mainly due to energy losses in the forward calorimeters, in which only about 40% of the hadronic energy is observed. This makes the measurement very dependent on the angular distribution of the hadrons in the final state. In an attempt to make the energy response of the detector more uniform and to reduce the systematic error due to the uncertainty in the Monte Carlo modelling, a new (corrected) variable is formed: W cor , along with the corresponding x measurement x cor , in which the contribution from the forward region has been scaled by a factor of 2.5. This factor was obtained by comparing the generated and measured energy in the forward region in Monte Carlo events. The uncertainty of this factor has been taken into account in the evaluation of the systematic errors. Figure 13a shows the correlation between the generated W and the three measured quantities W vis , W rec and W cor for the LEP2 SW sample generated with HERWIG 5.9+k t (dyn). The spread of W cor in W is larger than that of the other two variables, because in W cor more weight is given to the forward region, where the energy resolution is worse than in the central region. In general, W cor is still lower than W , mainly because of energy lost in the beampipe. Figure 13b shows the correlation between the generated energy in the forward region and the scaled observed energy in that region, E cor .
The three measured variables x vis , x rec and x cor can be used to unfold the true variable x. The difference between the results is model dependent and generally small when using Monte Carlo models that already give a good description of the hadronic final state (Figure 12c).

Two dimensional unfolding
The GURU program can also be used to perform unfolding in two dimensions. As with the attempts to improve the W reconstruction described above, the motivation is to reduce the dependence of the unfolding on a particular Monte Carlo model. There is information in every event about the angular distribution of energy in the detector, but it is not fully exploited if only x is used in the unfolding procedure. Including another variable in a second unfolding dimension makes more direct use of this information. Two variables were considered as possible second unfolding variables. They were chosen because they are very sensitive to the angular distribution of the hadrons in the final state: • E out T /E tot , the transverse hadronic energy out of the plane containing the beam line and the tagged electron (see Equation 5), divided by the total observed energy.
• E for /E tot , the observed energy in the forward calorimeters divided by the total observed energy.
These variables are shown in Figures 9-11. To test the two dimensional unfolding procedure, a random number was used as the second variable. The results are shown in Figure 12d. They are consistent with one dimensional unfolding, which is as expected since there is no extra information in a random number.

Testing the unfolding methods
To find which of the unfolding methods was the most reliable, OPAL data was unfolded with four Monte Carlo programs HERWIG 5.9, HERWIG 5.9+k t (dyn) PHO-JET 1.05 and F2GEN. Despite not giving good descriptions of the data, HERWIG 5.9 and F2GEN were included in this study to investigate the effectiveness of the new techniques using extreme models. The best methods are considered to be those which give the smallest difference between the unfolded results with the four Monte Carlo models. The quantity compared is χ 2 , defined by where F γ 2 i is the value of the unfolded result in the ith bin, F γ 2 i is the average of the results from all four Monte Carlo models and σ i is the statistical error for F γ 2 i . The values of χ 2 are shown in Table 4 and the results of the unfolding are shown in Figures 14-16. The χ 2 values show how large the differences between the models are, compared to the statistical error. In general, the lowest values of χ 2 were obtained using two-dimensional unfolding with x cor as the first variable and E out T /E tot as the second variable, which consequently is used as the standard unfolding method for the results. The χ 2 values are usually smaller for the LEP1 FD sample than the other two. This is partly due to the smaller number of bins. Additionally, the lower statistics in the LEP1 FD sample mean that any difference between the Monte Carlo programs would be less significant.
With both two dimensional unfolding and with one dimensional unfolding using x cor as the unfolding variable, the spread between the results with the different models is reduced compared to one dimensional unfolding results using x vis , and the agreement between F2GEN and the other models is especially improved. Using different unfolding methods with the HERWIG 5.9+k t (dyn) and PHOJET 1.05 samples makes less difference than with the other two Monte Carlo samples, which is as expected for models which give a better description of the data.
As a final test of the unfolding method, several samples of Monte Carlo events generated using HERWIG 5.9 with the SaS1D structure function were unfolded using HERWIG 5.9 with the GRV LO structure function. The results are shown in Figure 17. The measured structure function agrees with the input structure function within the statistical precision of the Monte Carlo events generated using the SaS1D structure function.
The effect of taking into account the shape of f (x) within each bin of x, as in Equation 7, compared to simply extracting F γ 2 as the average input structure function of the Monte Carlo sample times the ratio of the number of unfolded events and the number of Monte Carlo events in each bin, was also investigated with these samples. The effect is largest in the lowest x bins for LEP2 SW, where the decrease in the value of F γ 2 when taking into account the shape within the bin is a little less than the statistical errors shown in Figure 17.
x vis x cor x rec x cor x rec x cor 1.9 LEP1 SW 66. 8

Results
The photon structure function F γ 2 is measured by unfolding each data sample in bins of log(x). Each OPAL data sample is divided into two ranges of Q 2 containing approximately equal numbers of events. The ranges correspond to Q 2 values of 1.9 and 3.7 GeV 2 for the LEP1 SW sample, and 8.9 (10.7) and 17.5 (17.8) GeV 2 for the LEP1 FD (LEP2 SW) sample, where the average Q 2 values have been obtained from the data. F γ 2 is unfolded to a given Q 2 value and does not correspond to an average in each bin of Q 2 .
The results are listed in Table 6. The quoted values were measured as the average F γ 2 /α in each bin of x weighted by the unfolded x distribution, according to Equation 7, then corrected to the log centre of each bin, except for the highest x bins where the log centre of that portion of the bin below the charm threshold for m c = 1.5 GeV was used. The bin-centre corrections are the average of the GRV LO and SaS1D predictions for the correction from the average F γ 2 over the bin to the value of F γ 2 at the nominal x position.
The results are also corrected for radiative effects. The radiative corrections were calculated using the RADEG [36] program and are listed in Table 7, along with the bin-centre corrections. The statistical correlations between bins are shown in Table 8.
The central value of F γ 2 in each x bin is the average of the data unfolded with HERWIG 5.9+k t (dyn) and PHOJET, using two dimensional unfolding with x cor as the first variable and E out T /E tot as the second variable. Standard HERWIG 5.9 and F2GEN were not used as they are not in acceptable agreement with the data. The systematic errors are evaluated by repeating the unfolding with one parameter varied at a time and finding the shift in the result. The systematic errors are combined by adding all of the individual contributions in quadrature. The summing is done separately for positive and negative errors. The systematic effects considered are listed below.
• Monte Carlo modelling: The quoted result is the average of the results obtained using HERWIG 5.9+k t (dyn) and PHOJET 1.05. The errors are symmetrical and equal to half the difference between the results with the two Monte Carlo programs.
• Unfolding method: The unfolding was repeated with E for /E tot as the second variable, instead of E out T /E tot .
• Unfolding parameters: The number of bins used for the measured variable can be different from the number of bins used for the true variable (though it should be at least as large).
The standard result has 6 bins in the measured variable. This was increased to 8 to estimate the systematic effects of unfolding.
• W reconstruction: The weighting applied to the energy in the forward calorimeters was varied between 2.0 and 3.0. This check allows for uncertainty in the treatment of forward energy.
• Variations of cuts: The composition of the selected events was varied by changing the cuts one at a time. The size of the variations reflect the resolution of the measured variables and are sufficiently small not to change the average Q 2 of the sample significantly, or remove too many events from any single x bin. The variations of the cuts are listed in Table 5.
• Off-momentum electrons: There is a possible contamination of off-momentum background in one region of azimuthal angle φ in the 183 GeV data, so as a precaution this φ region was removed in the measurement of F γ 2 , and the difference from the result with the φ region left in was included as a systematic error.
• Calibration of the tagging detectors (FD and SW): The energy of the tagged electron in the Monte Carlo samples was scaled by ±1%, to allow for uncertainty in the simulation of the detectors. The size of the variation was motivated by a comparison of the E tag /E b distributions in data and Monte Carlo.
• Measurement of the hadronic energy: The main uncertainty is in the calibration of the electromagnetic calorimeters (excluding the forward region which is dealt with separately). This was varied by ±3% in the Monte Carlo samples. The track quality criteria were also varied, but the effects were negligible.
• Simulation of background: The hadronic background events at LEP1 stem from the production of photons or light mesons with a large fraction of the beam energy. The mesons fake an electron tag due to their decays into photons. The cross-section for these events was measured by OPAL with an accuracy of about 50%, and found to be consistent with the JETSET prediction [44]. The normalisation of the simulated background was therefore varied by ±50%.
• Bin-centre correction: The corrections depends on the shape of F γ 2 in each bin. The average of the corrections based on GRV LO and SaS 1D was used, as these parameterisations are the closest to the data. The error is half the difference between the GRV LO and SaS 1D corrections, and is symmetric.
Because the estimation of each source of systematic error has some statistical fluctuation due to the changing distributions (e.g. when a cut is varied), the quadratic sum of all individual sources is an overestimate of the total systematic error. The expected statistical component of each source of systematic error that changes the data sample is determined by unfolding 8 Monte Carlo samples, each about the same size as the data sample.
The corrected systematic error on the ith bin from source k is then where ∆f i,k is the shift on the ith bin in the data when changing a parameter k, and σ ∆f i,k is the expected statistical component of the shift, which is approximated by the statistical spread in the systematic error estimates for source k in the 8 Monte Carlo samples. In a few cases this was larger than the statistical error in the data. In these cases σ ∆f i,k was set to the statistical error in the data, in order not to hide possibly significant systematic errors. This procedure means that for the individual systematic errors either the expected statistical component is subtracted, or the error is set to zero if the observed shift in the data is consistent with a statistical fluctuation as predicted by the 8 Monte Carlo samples.
The total systematic error is the quadratic sum of all the individual contributions ∆ ′ f i,k from the above sources. The sum is made separately for deviations above and below the standard result. The systematic errors from each source as a function of x and Q 2 are listed in Table 9.
The results are shown along with previous OPAL measurements of F γ 2 [16] in Figures 18 and 19. The overlapping results from the LEP1 FD and LEP2 SW samples are in good agreement. The measurements of F γ 2 using LEP1 data for Q 2 = 1.9 GeV 2 and Q 2 = 3.8 GeV 2 are lower than the previous OPAL LEP1 results, which were unfolded using HERWIG 5.8d. Repeating the unfolding with HERWIG 5.8d gives results which are consistent with the old analysis, but with better precision. The HERWIG 5.8d Monte Carlo model has now been replaced by HERWIG 5.9+k t (dyn), which gives a better description of the data. The results in the four higher Q 2 bins are consistent with previous OPAL measurements, and have smaller errors. The previous LEP1 results with electrons tagged in SW or FD are superseded by the present analysis.
In Figures 20 and 21 the results are compared to measurements of F γ 2 from other experiments: TPC/2γ [13], PLUTO [10], TOPAZ [15], ALEPH [19], DELPHI [17] and L3 [18]. Also shown in Figures 20 and 21 are the GRV LO, SaS1D and WHIT1 [8] 7 parameterisations of F γ 2 , and the naive quark-parton model (QPM). The QPM prediction, which only models the point-like component of F γ 2 , is calculated for four active flavours with masses of 0.2 GeV for light quarks and 1.5 GeV for charm quarks.
The previous results are found to be generally consistent with the new OPAL measurements. The largest differences are from the older measurement by TPC/2γ at Q 2 = 2.8 GeV 2 , which suggests a different shape of F γ 2 to all the other measurements. The L3 results obtained at Q 2 = 5.0 GeV 2 are consistently higher than the OPAL measurements. However, because the L3 points are highly correlated, due to the finer binning in x, the discrepancy looks stronger than it actually is.  Due to large spread in the theoretical predictions for the P 2 suppression of F γ 2 (as discussed in Section 4) the OPAL measurements are not corrected for this effect. Since the data contain a P 2 suppression, which is not included in the Monte Carlo simulation, applying the correction would lead to an x dependent increase of the measured F γ 2 . This should be taken into account when comparing to the parameterisations of F γ 2 , which are all shown for P 2 = 0. In general the shape of the GRV LO parameterisation is consistent with the OPAL data in all the accessible x and Q 2 regions. The normalisation is also consistent with the data, except at the lowest scale, Q 2 = 1.9 GeV 2 , where GRV is too low. The SaS1D LO prediction shows a slower evolution with Q 2 than the GRV prediction. At low Q 2 it is slightly above GRV LO, whereas at the largest Q 2 values shown, it falls below GRV LO. Within the precision of the OPAL measurement, the description of the data by SaS1D LO is of similar quality to GRV LO. In the region of applicability, the WHIT1 prediction is higher than the OPAL data and flatter than the other predictions, though the shape is still consistent with the data.
The hadron-like component is predicted to dominate at low values of x. For x < 0.1 the naive quark-parton model is not able to describe the data, indicating that the photon must contain a significant hadron-like component at low x.

Conclusions
The photon structure function F γ 2 has been measured using deep inelastic electronphoton scattering events recorded by the OPAL detector during the years 1993-1995, 1997 and 1998, at e + e − centre-of-mass energies of 91 GeV (LEP1), and 183-189 GeV (LEP2). F γ 2 has been measured as a function of x to the lowest attainable x values, in six ranges of Q 2 (including two overlapping pairs) corresponding to average Q 2 values of 1.9, 3.7 GeV 2 for LEP1 SW, and 8.9 (10.7), 17.5 (17.8) GeV 2 for LEP1 FD (LEP2 SW). In previous OPAL studies of the photon structure function, it became clear that a large source of uncertainty in the measurement came from the Monte Carlo modelling of the hadronic final state of deep inelastic electron-photon scattering events. Since then, improved Monte Carlo models have become available. In a comparison of the energy flows and E out T distributions, which are very sensitive to the modelling of the hadronic final state, these new models, HERWIG 5.9+k t (dyn) and PHOJET 1.05, give better descriptions of OPAL data than the HERWIG 5.9 and F2GEN programs. Consequently the latter two programs have not been used for the F γ 2 measurement. Previous OPAL measurements of F γ 2 using LEP1 data with electrons tagged in SW or FD are superseded by this analysis.
To further reduce the Monte Carlo modelling error, two dimensional unfolding has been introduced, using E out T /E tot as a second unfolding variable. Also, the reconstruction of the invariant mass of the hadronic final state has been improved by including information from the deeply inelastically scattered electron, and by scaling the energy observed in the forward calorimeters to partially compensate for energy losses. Monte Carlo modelling of the final state is still a significant source of systematic error, but it no longer dominates all other sources. The total systematic errors are of comparable size to the statistical errors.
Although the precision of the measurement at low x has been considerably improved it is still insufficient to determine whether or not there is a rise in F γ 2 in that region.
The GRV LO and SaS1D parameterisations are generally consistent with the OPAL data in all the accessible x and Q 2 regions, with the exception of the measurement at the lowest scale, Q 2 = 1.9 GeV 2 , where GRV is too low. In contrast, the naive quarkparton model is not able to describe the data for x < 0.1. These results show that the photon must contain a significant hadron-like component at low x.   Table 7: Corrections to the result as a function of x in bins of Q 2 , as a percentage of the non-corrected F γ 2 . The radiative corrections were predicted by RADEG [36]. The bin-centre corrections are the average of the GRV LO and SaS1D predictions for the correction from the average F γ 2 over the bin to the value of F γ 2 at the nominal x position. The x positions are at the log centre of the bins, except for the highest x bins, where they are at the the log centre of that portion of the bin below the charm threshold for m c = 1.5 GeV.   Table 6.    Table 9: Systematic errors for F γ 2 /α as a function of x in bins of Q 2 . The statistical error and the contributions to the systematic error from each of the sources are listed as percentages of F γ 2 /α . The systematic errors are corrected for the expected statistical component as described in the text. They are summed in quadrature separately for positive and negative deviations, except for 'MC modelling' and 'bin centre' which are symmetric.   Figure 2: Comparison of data distributions with Monte Carlo predictions for the LEP1 SW sample. The dominant background source, γ ⋆ γ → τ + τ − , the total background and the sum of the signal and the total background for HERWIG 5.9+k t (dyn) and PHOJET 1.05 are shown. The Monte Carlo samples have been normalised to the data luminosity. All selection cuts have been applied, except for any cut on the variable in the plot. The cuts are shown as dotted lines. The distributions shown are: a) E tag /E b , the energy of the tagged electron as a fraction of the beam energy, b) θ tag , the polar angle of the tagged electron, c) the measured Q 2 and d) E a /E b , the energy of the most energetic electromagnetic cluster in the hemisphere opposite the tagged electron, as a fraction of the beam energy. The errors are statistical only.   Figure 3: Comparison of data distributions with Monte Carlo predictions for the LEP1 FD sample. The dominant background sources, γ ⋆ γ → τ + τ − and Z 0 → hadrons, the total background and the sum of the signal and the total background for HERWIG 5.9+k t (dyn) and PHOJET 1.05 are shown. The Monte Carlo samples have been normalised to the data luminosity. All selection cuts have been applied, except for any cut on the variable in the plot. The cuts are shown as dotted lines. The variables in the plots are as defined in Figure 2. The errors are statistical only.   Figure 4: Comparison of data distributions with Monte Carlo predictions for the LEP2 SW sample. The dominant background source, γ ⋆ γ → τ + τ − , the total background and the sum of the signal and the total background for HERWIG 5.9+k t (dyn) and PHOJET 1.05 are shown. The Monte Carlo samples have been normalised to the data luminosity. All selection cuts have been applied, except for any cut on the variable in the plot. The cuts are shown as dotted lines. The variables in the plots are as defined in Figure 2. The errors are statistical only.      Figure 6: Comparison of data distributions with Monte Carlo predictions for the LEP1 FD sample. The dominant background sources, γ ⋆ γ → τ + τ − , Z 0 → hadrons and γ ⋆ γ ⋆ → hadrons (with P 2 >1.0 GeV 2 ), the total background and the sum of the signal and the total background for HERWIG 5.9+k t (dyn) and PHOJET 1.05 are shown. For illustration, the background from γ ⋆ γ → µ + µ − is also shown, though it is only significant below the N ch cut. The Monte Carlo samples have been normalised to the data luminosity. All selection cuts have been applied, except for any cut on the variable in the plot. The cuts are shown as dotted lines. The variables in the plots are as defined in Figure 5. The errors are statistical only.   Figure 7: Comparison of data distributions with Monte Carlo predictions for the LEP2 SW sample. The dominant background sources, γ ⋆ γ → τ + τ − , Z 0 → hadrons and γ ⋆ γ ⋆ → hadrons (with P 2 >4.5 GeV 2 ), the total background and the sum of the signal and the total background for HERWIG 5.9+k t (dyn) and PHOJET 1.05 are shown. For illustration, the background from γ ⋆ γ → µ + µ − is also shown, though it is only significant below the N ch cut. The Monte Carlo samples have been normalised to the data luminosity. All selection cuts have been applied, except for any cut on the variable in the plot. The cuts are shown as dotted lines. The variables in the plots are as defined in Figure 5. The errors are statistical only.

1/N dE/dη
[GeV]           Figure 13: (a) The correlation between the generated and measured invariant mass, W and W measured , for the HERWIG 5.9+k t (dyn) LEP2 SW Monte Carlo sample, using, W vis , W rec and W cor . (b) The corrected energy, E cor , (2.5 times the visible energy) in the forward region against the total generated energy, E, deposited in that region, for HERWIG 5.9+k t (dyn) and PHOJET 1.05. The vertical error bars represent the RMS spread within each bin. The dotted lines represent perfect correlation.    Figure 17: Two independent samples of Monte Carlo events generated using HERWIG 5.9 with the SaS1D structure function unfolded with HERWIG 5.9 with the GRV LO structure function, in six Q 2 ranges. The solid histogram shows the SaS1D structure function at the same average Q 2 as the sample in each plot, weighted by the x distribution in the HERWIG SaS1D sample (this is the quantity measured by Equation 7). The dotted lines show the GRV LO structure function weighted by the unfolded x distribution. The unfolded SaS1D samples were about the same size as the data samples in each Q 2 region. The errors are statistical only.  Figure 18: The measurement of F γ 2 /α using the LEP1 SW sample, for Q 2 values of (a) 1.9 and (b) 3.7 GeV 2 . Also shown are the previous OPAL results in these Q 2 ranges, which were unfolded using HERWIG 5.8d, and the result of unfolding the LEP1 SW data using HERWIG 5.8d. For each point, the inner error bars show the statistical error and the full error bars show the total error, except for the new result with HERWIG 5.8d, for which only statistical errors are shown. The positions of the new OPAL points are as given in Table 6. The other points are shown at the centre of the log(x) bin. The curves show the GRV LO structure function.  Figure 19: The measurement of F γ 2 /α using the LEP1 FD and LEP2 SW samples for Q 2 values of (a) 8.9 (10.7) and (b) 17.5 (17.8) GeV 2 for LEP1 (LEP2). Also shown are the previous OPAL results in these Q 2 ranges, which were unfolded with HERWIG 5.8d ( Q 2 =7.5 GeV 2 and Q 2 =14.7 GeV 2 ) and HERWIG 5.9 ( Q 2 =9.0 GeV 2 and Q 2 =14.5 GeV 2 ) using a linear x scale. For each point, the inner error bars show the statistical error and the full error bars show the total error. The positions of the new OPAL points are as given in Table 6. The other points with closed symbols are shown at the centre of the log(x) bin, and those with open symbols are shown at the average x value of the bin. The curves show the GRV LO structure function.  Figure 20: The measurement of F γ 2 /α using the LEP1 SW sample, for Q 2 values of (a) 1.9 and (b) 3.7 GeV 2 . Also shown are the results from L3 [18] PLUTO [10], and TPC/2γ [13]. For L3 the two sets of points were unfolded using different Monte Carlo programs. The lower / upper points correspond to PHOJET 1.05 / TWOGAM. For each point, the inner error bars show the statistical error and the full error bars show the total error. The positions of the new OPAL points are as given in Table 6. The other points with closed symbols are shown at the centre of the log(x) bin, and those with open symbols are shown at the average x value of the bin. The curves show the GRV LO, SaS1D, WHIT1 and QPM structure functions.  Figure 21: The measurement of F γ 2 /α using the LEP1 FD and LEP2 SW samples for Q 2 values of (a) 8.9 (10.7) and (b) 17.5 (17.8) GeV 2 for LEP1 (LEP2). Also shown is a selection of results from other experiments: ALEPH [19], DELPHI [17], L3 [18], PLUTO [10], and TOPAZ [15]. For each point, the inner error bars show the statistical error and the full error bars show the total error. The positions of the new OPAL points are as given in Table 6. The other points with closed symbols are shown at the centre of the log(x) bin, and those with open symbols are shown at the average x value of the bin. The curves show the GRV LO, SaS1D, WHIT1 and QPM structure functions.