Planetary detection limits taking into account stellar noise
Key Words.:
stars: individual: Cen B – stars:planetary systems – stars: activity – Sun: activity – Sun: sunspots – techniques: radialvelocitiesAbstract
Context:The detection of small mass planets with the radialvelocity technique is now confronted with the interference of stellar noise. HARPS can now reach a precision below the meterpersecond, which corresponds to the amplitudes of different stellar perturbations, such as oscillation, granulation, and activity.
Aims:Solar spot groups induced by activity produce a radialvelocity noise of a few meterpersecond. The aim of this paper is to simulate this activity and calculate detection limits according to different observational strategies.
Methods:Based on Sun observations, we reproduce the evolution of spot groups on the surface of a rotating star. We then calculate the radialvelocity effect induced by these spot groups as a function of time. Taking into account oscillation, granulation, activity, and a HARPS instrumental error of 80 cm s, we simulate the effect of different observational strategies in order to efficiently reduce all sources of noise.
Results:Applying three measurements per night of 10 minutes every three days, 10 nights a month seems the best tested strategy. Depending on the level of activity considered, from to , this strategy would allow us to find planets of 2.5 to 3.5 M in the habitable zone of a K1V dwarf. Using Bern’s model of planetary formation, we estimate that for the same range of activity level, 15 to 35 % of the planets between 1 and 5 M and with a period between 100 and 200 days should be found with HARPS. A comparison between the performance of HARPS and ESPRESSO is also emphasized by our simulations. Using the same optimized strategy, ESPRESSO could find 1.3 M planets in the habitable zone of K dwarfs. In addition, 80 % of planets with mass between 1 and 5 M and with a period between 100 and 200 days could be detected.
Conclusions:
1 Introduction
We have now discovered more than 400 exoplanets with the radialvelocity (RV) technique
Oscillations of solar type stars, which can be seen as a dilatation and contraction of external envelopes over timescales of a few minutes (5 minutes for the Sun; Schrijver & Zwann 2000; Broomhall et al. 2009), is caused by pressure waves (pmodes) that propagate at the surface of solar type stars. The individual amplitudes of pmodes are typically from a few to tens of cm s, but the interference of tens of modes with close frequencies introduce RV variations of several cm s, depending on the star’s spectral type and evolutionary stage (Bedding & Kjeldsen 2007; Bouchy & Carrier 2003; Bedding & Kjeldsen 2003; Schrijver & Zwann 2000). The amplitude and period of the oscillation modes increase with mass along the main sequence. Theory and observations show that the frequencies of the pmodes rise with the square root of the mean density of the star and that their amplitudes are proportional to the ratio of the luminosity over the mass (ChristensenDalsgaard 2004; O’Toole et al. 2008).
The convection in external layers of solar type stars drives different phenomena of granulation (granulation, mesogranulation, and supergranulation), which also affect the RV measurements on time scales going from several minutes to several hours. In order of timescale and size of convective pattern, we first have granulation, whose typical timescale is shorter than 25 minutes (Title et al. 1989; Del Moro 2004). Then comes mesogranulation and finally supergranulation, with timescales of up to 33 hours (Del Moro et al. 2004). When integrated over the entire stellar disk, all these convection perturbations present a noise level on the order of the meterpersecond.
On longer timescales, similar to the star rotational period, the presence of activity related spots and plages perturbs precise RV measurements. Spots and plages on the surface of a star will break the flux balance between the redshifted and the blueshifted halves of the star. As the star rotates, a spot group, or a plage, moves across the stellar disk and produces an apparent Doppler shift (Saar & Donahue 1997; Queloz et al. 2001; Huélamo et al. 2008; Lagrange et al. 2010). This effect can be hard to distinguish from the signal caused by the presence of a planet. For the Sun at maximum activity of cycle 23, Meunier et al. (2010) find a noise related to spot groups and plages of 42 cm s. Because the temperature of spot groups and plages are different from the mean stellar surface, and because active regions contain both, the noise induced by them will usually be compensated, but not entirely, because the surface ratio between spots and plages varies (e.g. Chapman et al. 2001). According to Meunier et al. (2010), the major perturbative effect of activity on RVs is not the one induced directly by spot groups and plages, but the one caused by the inhibition of convection in active regions (e.g. Dravins 1982; Livingston 1982; Brandt & Solanki 1990; Gray 1992). This effect leads to a blueshift distortion of spectrum lines, which results in a noise varying from 40 cm s at minimum activity to 140 cm s at maximum.
The Earth produces a radialvelocity perturbation of 9 cm s on the Sun, which would be completely masked by the stellar noise. If we manage to understand the structure of these different kinds of noise, we could use appropriate observational strategies to reduce the stellar noise contribution as much as possible, and thus find very small mass planets far from their host star. This investigation of optimized observational strategies is essential for future and more accurate instruments, such as ESPRESSO@VLT (http://espresso.astro.up.pt/, precision expected: 10 cm s) or CODEX@EELT (e.g. Pasquini et al. 2008, precision expected: 2 cm s), in order to detect Earth twins (Earthmass planets in habitable regions).
The two first types of noise, which are caused by oscillations and granulation phenomena, have been discussed in a previous paper (Dumusque et al. 2011, hereafter Paper I). Starting from HARPS asteroseismology measurements, we characterized these two kinds of noise using the power spectrum representation. Then we derived an optimized observational strategy, reducing at best the stellar noise contribution while keeping a reasonable total observational time.
In the present paper, we continue this first study and include the noise coming from magnetic activity spot groups. Starting from observations of the Sun, which is the only star where we can resolve spot groups, we simulate the appearance of spot groups on its surface and calculate the RV contribution. Adding the noise coming from magnetic activity to the results obtained in Paper I, we develop an observational strategy that simultaneously reduces the three kinds of stellar noise. To finish, we calculate the corresponding detection limits, as well as the expected number of planets that could be found by comparing our results with Bern’s model of planetary formation.
2 Simulation of different activity levels
For decades surveys have studied the activity level of FGK stars in the solar neighborhood (e.g. Baliunas et al. 1995; Hall et al. 2007). One of the main results of these surveys is the bimodality of the activity level distribution (e.g. Vaughan & Preston 1980; Henry et al. 1996). About 70% of the stars, even those with a clear Sunlike activity cycle, seem to have an activity level below (see Noyes et al. 1984, for more information about the chromospheric emission ratio, ). The others have a higher activity level, and no crossing between the lowactivity and highactivity region seems to occur. Early results of the Kepler mission arrive at the conclusion that there may be more active stars than expected (see Basri et al. 2010). Of more than 100000 stars with a surface gravity, , higher than 4 and an effective temperature between 3200 and 19000 K, 50 % seem to be more active than the active Sun. Nevertheless, very lowactivity stars have been discovered in this sample.
In the present paper, we simulate the activity of stars below . The goal is to reproduce as realistically as possible the appearance of stellar spot groups on the surface of these stars. Because the Sun is the only star with an activity level between and for which we can resolve stellar spot groups, we will use it as a proxy for lowactivity stars. Stars with a higher than 4.75 will not be simulated. First of all because of a lack of information about the activity phenomenon of these stars, and secondly, because the activity noise will probably be too important to find very small mass planets in habitable regions with the RV technique.
2.1 Activity of the Sun
The Sun has a 22year magnetic cycle as well as a 11year sunspot cycle. During these 11 years, the activity level varies between at minimum and at maximum. At minimum there is no spot group on the surface of the Sun. When the activity level rises, several spot groups start to appear at a latitude of about . Before reaching the maximum activity level, the number of spot groups will increase progressively, whereas the latitude of spot groups will decrease ( at maximum activity). After this maximum phase, the migration of spot groups toward the equator continues and the number of groups decreases progressively until they all disappear. At the end of a cycle, the activity level returns to , and another sunspot cycle of 11year begins again.
Because the latitude of the spot groups changes during the sunspot cycle, it is important to recall that the Sun has a differential rotation in latitude. Therefore, spot groups rotate faster at the equator than at the poles, according to the equation
(1) 
where is the angular speed in , the latitude, and A and B are two constants equal to and (see Howard 1996).
During the 11year sunspot cycle or an even longer time, the spot groups do not appear randomly in longitude, but 15 around preferred active longitudes (e.g. Ivanov 2007; Berdyugina & Usoskin 2003). These preferred longitudes turn with the Sun’s differential rotation and are observed for large spot groups

there are normally two active longitudes in the northern hemisphere and two in the southern one,

the two active longitudes are shifted by 180 in each hemisphere,

the active longitudes in the south are shifted by 90 in regard to those in the north,

sometimes, each hemisphere can have a third active longitude, which is shifted by 90 with regard to the two normal ones.
When sunspots appear, their diameter is very small, approximately 0.003. The majority of them will disappear in a couple of hours, but a few will evolve and become bigger spots, which will gather together as spot groups. These can last for several days to a few months. According to Howard & Murdin (2000), the lifetime distribution of spot groups is

50% of the spot groups have a lifetime between 1 hour and 2 days,

40% of the spot groups have a lifetime between 2 days and 11 days,

10% of the spot groups have a lifetime between 11 days and 2 months.

For the Sun at maximum activity (), there is one group spot per sunspot cycle that can last from 2 to 6 months.
During its lifetime, a spot group will rapidly increase in area until the maximum size is reached, after which the decrease will follow in a slower way (e.g Howard & Murdin 2000). The increase and decrease times are not precisely defined. For our simulation, we will set these times to 1/3 and 2/3 of the total lifetime of a spot group.
The maximum spot group filling factor
(2) 
where is expressed in hemisphere and in days.
Once the lifetime and the maximal size of spot groups are defined, we need a probabilistic process to mimic the observed appearance of spot groups on the solar surface. According to Hoyt & Schatten (1998), the appearance of spot groups can be described by a Poisson law. We can therefore define the probability, , to have a certain number of spot groups, , by the formula
(3) 
where is the time, the time step and the average appearance of spot groups per unit of time.
The upper panel in Fig. 1 represents the continuous daily observation of sunspots (Marshall Space Flight Centre, http://solarscience.msfc.nasa.gov/SunspotCycle.shtml). Clearly, the number of spots varies with the 11year sunspot cycle of the Sun. Comparing the two graphs in Fig. 1, we notice that the number of spots is correlated with the Sindex and thus with the (because S and are proportional, see Noyes et al. (1984)). In conclusion, no spot corresponds to , and the maximum number of spots corresponds to . According to this argument, we decide to simulate three different levels of activity:

: the Sun is at minimum activity and there is no spot group on the solar surface. Therefore, no noise from the spot groups perturbs the RVs (case of Paper I).

: the Sun is at maximum activity. The number of spots in the visible hemisphere is approximately equal to 150 and they are located at of latitude (e.g Howard & Murdin 2000).

: the Sun is at medium activity, which corresponds to a total number of 75 spots at of latitude in the visible hemisphere.
The activity level is given by a defined number of spots, but the properties defined above, such as the lifetime distribution, the maximum filling factor (Eq. 2), and the appearance statistical law (Eq. 3), are given for spot groups. In order to use all these properties, we need to know the number of spot groups of a defined number of spots. This can be done using the work of Hathaway & Choudhary (2008), which gives the number of sunspots per spot group as a function of the spot group maximum filling factor (see Fig. 2).
2.2 Activity simulation
In order to check our ability to detect Earthlike planets through activity effects, our simulations will try to mimic the Sun activity phenomena as much as possible.
For the simulation, we start with the following initial conditions:

We set the number of spots, , for a given activity level.

This maximum filling factor gives us, using Fig. 2, the number of spots present in this spot group, . If , we create a second spot group and so on, until .

Each spot group is put on the latitude corresponding to the activity level and on the different active longitudes, with no possible overlap. The longitudes of the spot groups are selected using a Gaussian distribution
^{4} centered on active longitudes and with a dispersion of .
Once the initial conditions are created, we start the time evolution of the system, using a rotational period of the Sun at the equator of 26 days. The time step of the simulation is set to two hours. All spot groups begin with a null area and increase their size linearly until they reach the maximum after 1/3 of their lifetime (see Eq. 2). The decreasing phase is linearly during the remaining 2/3 of their lifetime. The number of new spot groups at each time step is given by the appearance statistical law (see Eq. 3).
At the beginning of the simulation, the value for the average appearance of spot groups, , is unknown. To find the right value, we begin the simulation with an arbitrary number and calculate the total filling factor for each time step. Using Fig. 2, this filling factor give us the total number of spots. If the mean total number of spots in not equal to in several years, we restart the simulation with another value for . Finally, after several tests, is set to 2.4 and 4.8 for an activity level of and , respectively.
For each level of activity, we generate 100 simulations with an equal probability of having two or three active longitudes per hemisphere.
2.3 Radialvelocities induced by activityrelated spot groups
When spot groups are present on a rotating star, they break the flux balance between the redshifted and the blueshifted halves of the star. As the star rotates, a spot group moves across the stellar disk and produces apparent Doppler shifts. This modulation is often hard (sometimes impossible) to distinguish from the Doppler modulation caused by the gravitational pull of a planet. To model the radialvelocity variations imposed by stellar spot groups, we used the program SOAP (Bonfils & Santos, in prep.). SOAP computes the rotational broadening by sampling the stellar disk on a grid. Each grid cell is assigned a Gaussian function that represents the typical emerging spectral line (or, equivalently, the spectrum’s Cross Correlated Function (CCF)). All cells are Dopplershifted according to their projected velocities toward the observer’s lineofsight, and averaged with a weight following a linear limbdarkening law (). When spot groups are added to the stellar surface, SOAP computes which grid cells they occupy and change the weight for those given cells. The weight can be set to zero for a dark spot group, to a fraction of the stellar brightness if we assume a given brilliance for the spot group, or to more than the stellar brightness to simulate plages. Finally, for a given spot group configuration, SOAP delivers the modified spectral line, the flux, the radialvelocity, and the bisector span as a function of the star’s rotating phase. Note that SOAP merely calculates the flux effect induced by spot groups. It does not model any type of granulation, therefore, the inhibition of convection (granulation) in active regions such as spot groups is not took into account (Meunier et al. 2010).
Normally a spot group is constituted of several small spots. For the simulation we considered each spot group as a unique big spot to facilitates the operation. We used the program SOAP to calculate the RV effect induced by a spot group presenting a filling factor, of 0.001 and a latitude of 15 (Fig. 3) or 22.5. The first latitude corresponds to a of 4.75 and the second to a of 4.9. To get the RV effect for any spot group filling factor, , we first calculate the RV semiamplitude with SOAP, , which is induced by different spot group sizes. Then we fitted a power law model to these data (Fig. 4) and found that

for a spot group at a latitude of 15, ,

for a spot group at a latitude of 22.5, .
These results are consistent with the relation presented in Saar & Donahue (1997), where the authors have found that . We will use this relation in our simulation for all latitudes. Supposing that besides the amplitude, the RV signal has the same shape for any spot group size, we calculate the proportionality coefficient between two signals, which is . Multiplying the RV effect induced by a spot group with by this coefficient yields the RV effect induced by any spot group size.
The simulation does not consider spot groups with a lifetime shorter than one day, because the maximum filling factors will be very small an therefore the RV contribution negligible.
We present here a simulation for the Sun, which is seen equator on, therefore the inclination angle between the line of sight and the rotation axis of the star, , equals 90. For other stars, this assumption cannot be made because for the majority of them we do not have access to the angle. Moreover, when decreases, the RV which is equal to decreases as well . Thus the RV effect of spot groups crossing the visible stellar disc, which is proportional to , will be lower. In conclusion, considering in the simulation gives us the maximum RV effect that spot groups can induce.
2.4 Comparison with other works
An example of simulations for =4.9 and 4.75 can be seen in Fig. 5. The amplitudes found can be compared to a recent result from Meunier et al. (2010), who calculated the RV induced by the real distribution of spot groups during the solar sunspot cycle 23. For maximum activity, they find a RV variation of 48 cm s. Note that this variation is valid only for spot groups and was derived without taking into account plages or inhibition of the convection in active regions. In our case, after 100 simulations of a highactivity level, we arrive at a value of 63 cm s. Our estimate is 30 % higher that the one found by Meunier et al. (2010), which is based on real position and size of spot groups. This can be explained because Meunier et al. (2010) used observations of the solar cycle 23. This was a fairly quiet cycle, presenting at maximum no more than 100 sunspots in the visible hemisphere (see Fig. 1). In our case, the maximum number of sunspots was based on cycles 21 and 22, which were more intense (150 sunspots in the visible hemisphere at maximum activity level). In Fig. 7 we show the calculated RV variation obtained for different values of the total spot number. A quadratic fit to the data points indicates a RV variation of 51 cm s for 100 spots, which is very close to the value cited in Meunier et al. (2010). Thus, our simulation, starting from very simple characteristics of spot groups such as the lifetime distribution, the spot number according to the activity level, and the Poisson process of spot group appearance, manages very well to represent the RV effect induced by spot groups. Note that the inhibition of convection in active regions is not yet included in our simulation, and according to Meunier et al. (2010), this effect could be dominant. Therefore, the RV effect of the activity is not yet fully simulated and could be underestimated. The very good agreement comparing the spot flux effect between real Sun observations and the simulation indicates that our simple model of spot groups is realistic. Including the inhibition of convection in the simulation is in progress and will be the topic of a forthcoming paper. Our activity simulation presents an advantage compared with the work done by Meunier et al. (2010) in the sense that we can adapt it to other stars than the Sun, if basic characteristics of the spot groups are known.
According to Queloz et al. (2009) and Boisse et al. (2010), the RV signal caused by stellar spots is found at the rotational period of the star, , and its two first harmonics. When considering few big spots at the surface of the star that can change in size but not disappear rapidly, this assumption is valid because the activity signal stays in phase with rotation. In the two papers cited above, the authors consider active stars that are known to be dominated by few big spots that can live for several rotational periods. In our simulation, were we model nonactive stars, we consider a lot of spots that are allowed to disappear rapidly and reappear at other places any time. This random behavior, ruled by a Poisson law, kills the activity signal phase even on timescales similar to the rotational period of the star. The RV signal however keeps a kind of periodicity on a rotational period timescale, but the period can be different form and its two first harmonics. In Fig. 6 we plot the periodograms and the RVs for 31 day slices () of the simulation showed in Fig. 5 with an activity level of . Looking at the periodograms, we see that sometimes important peaks are present at , and (plot on the right), as suspected by Queloz et al. (2009) and Boisse et al. (2010). At other times, the peak at is clear but nothing shows up at and (plot on the left) or at the rotational period and the two first harmonics (plot on the center). Looking now at the RV plots, we see that the model of three sine waves with period , and can fit the simulated RVs nicely. The comparison between the rms of the simulated RVs and the corrected one (simulated RVs minus fitted sine waves) shows that the three sine waves model reduces the activity noise by 70 %. Therefore this model can be used to correct shortterm activity noise. However, if a small mass planet orbits the stars with a period close to , the three sine waves model will kill this signal too. We have therefore to be very careful when using these models, and only the study of other CCF parameters such as the full width at half maximum (FWHM) or the bissector span (BIS) will be able to give us clues on the nature of the signal: shortterm activity or planet. This work will be done once the inhibition of the convective blueshift in active regions will be implemented into our model. In addition, the estimate of the rotational period using the three sine waves model, as shown in Queloz et al. (2009), is no longer valid. Indeed, we see that the fitted period, , can vary between 22 to 34 days, whereas the true rotational period of the star is fixed to 26.4 days in the simulation.
3 Efficient observational strategies
The RV signal induced by a spot group can be described as the succession of null contribution when the spot group is in the hidden hemisphere, and as sinusoidal shapes when the spot group moves across the visible hemisphere (Fig. 3). This RV signal is regular in time and is composed of sinusoidal signals with periods equal to the rotational period of the star and its first harmonics (Boisse et al. 2010). If there are several spot groups, these RV signals will add or cancel themselves, leading to a quasiperiodic signal with unpredictable but limited amplitude. Because the period of the RV signal varies between approximately 10 to 30 days, the idea is to average out these effects by an adequate sampling of the variation time scale.
In Paper I, a good observational strategy to average out oscillation and granulation noise was to take three 10minute measurements per night with two hours of spacing. This strategy was applied to the actual calendar of HD69830, which is one of the most followed stars by HARPS. This calendar, besides observation gaps owing to bad weather, is very similar to 10 consecutive nights of observation every month. This idealistic strategy of three measurements per night of 10 minutes each, on 10 consecutive nights every month, will be referred to below as the 3N1 strategy. Because the typical time scales of activity noises are longer than 10 days (see Fig. 5), the 3N1 strategy is not optimized to average out this kind of perturbation. In order to average out all three kinds of noise at the same time, we have to cover a period of a month with measurements, while keeping the benefit of the higher frequency sampling. Taking into account this last argument, we simulate the effect of two new observational strategies. Keeping the three measurements per night of 10 minutes and the 10 nights of observation per month, we change the separation between each observational night. The new considered observational strategy will measure the star every second night (hereafter 3N2 strategy) and every third night (hereafter 3N3 strategy). For the 3N1, 3N2, and 3N3 strategies the calendar of HD69830 was not suitable any more and we had to create new calendars, asking for the following criteria:

The total observational time span is four years.

The star disappears from the sky during four months each year.

Each month of observation, the beginning of the measurements is randomly set.
In addition, we randomly suppress 20 % of nights to take into account bad weather or technical problems. These calendars represent a total of 256 nights of measurements over a time span of four years.
In Fig. 8 (left panel) we can compare these different observational strategies. The RV variation, induced by spot groups for a maximum level of activity (), reaches a level of approximately 60 cm s. By using the 3N3 strategy, we manage to reduce this level of RV variation to 10 cm s when averaging the effect in 10 day bins. For the same binning, the 3N1 strategy reduces this level to 30 cm s.
We can now calculate as a function of the binning in days, including all types of noise: oscillation, granulation, activity, and instrumental (80 cm s). The result for can be seen in Fig. 8 (right panel).
We notice that the 3N3 strategy averages out much more activity noise after binning than the 3N1 strategy. Without activity () the two strategies are identical. The RV variations reachable using 10 days bins (suitable for long period planets) with the best considered strategy are 22 cm s, 25 cm s, and 28 cm s for a equal to 5, 4.9 and 4.75, respectively. If there is high activity, the improvement brought by this optimal strategy reaches 30 % compared to the 3N1 strategy.
In conclusion, sampling the complete rotational period of the star with the same number of measurements, but with more separation between them, averages out the activity noise to a considerable extent. Thus, the strategy needs to be adjusted to the stellar rotational period. Here for the Sun, which has a 26 day rotational period, the 3N3 strategy is the optimal one with 10 measurements per month. For a star with a 20 day rotational period, the 3N2 strategy would average out the stellar noise better. In addition, increasing the gap between observational nights by more than three days for slow rotators is not recommended, because it will induce a poor sampling of short period planets.
4 Detection limits in the massperiod diagram
In order to derive detection limits in terms of planet mass and period accessible with the different studied strategies, we calculate the falsealarm probability (FAP) of detection using bootstrap randomization (Endl et al. 2001; Efron & Tibshirani 1998). We first simulate a synthetic RV set containing stellar, instrumental, and photon noises as we did in the previous sections. The calendars used depend on the selected strategy, but all have a total of 256 observational nights over a time span of four years (see Sect. 3). Then we carry out 1000 bootstrap randomizations of these RVs and calculate the corresponding periodograms. For each periodogram, we select the highest peak and construct a distribution of these 1000 highest peaks. The 1 % FAP corresponds to the power, which is only reached 1 % of the time. The second step consists in adding a sinusoidal signal with a given period to the synthetic RVs. We calculate the periodogram of these new RVs and compare the height of the observed peak with the 1 % FAP. We then adjust the semiamplitude of the signal until the power of the peak is equal to the 1 % FAP. The obtained semiamplitude corresponds to the detection limit of a null eccentricity planet with a confidence level of 99 %. To be conservative, we test 10 different phases and select the highest semiamplitude value.
In practice, we simulate 100 RV sets for each activity level (=5, 4.9 and 4.75) and strategy, and to be conservative, only the 10 ”worst” cases per strategy were considered. It is important to note that for each strategy we simulate the RVs according to the respective calendar (10 measurements per months on consecutive nights or every second or third night, eight months a year over four years) and we removed randomly 20 % of the nights to simulate bad weather or technical problems. The observational calendars are therefore realistic. Figure 9 presents for different levels of activity the 99 % confidence level detection limits expected for the 3N1 and the 3N3 strategies using HARPS.
The thick line in top part of Fig. 9 corresponds to the detection limit for a maximumactivity level () using the present HARPS observational strategy (1N1 strategy = one measurement of 15 minutes on 10 consecutive days per month, see Paper I for details). Obviously, there is a great improvement brought by the 3N1 and 3N3 strategies compared to what is done at present on HARPS.
We can also compare the level of the detection limits obtained for the 3N1 and 3N3 strategies. For example, at a 100 day period, the improvement brought by the 3N3 strategy is 4 %, 10 % and 17 %, for an activity level of , and , respectively. As expected, the improvement increases with a rising activity level because the 3N3 strategy is optimized to reduce activity noise. For minimum activity, which corresponds to no spot groups and consequently no noise from activity in our simulation, the detection limits obtained with the two strategies should be identical. The 4 % difference can be explained by a more regular sampling (over all months with the 3N3 strategy and only 10 consecutive days per month for the 3N1). In conclusion, the 3N3 strategy seems to be an efficient strategy to average out all kinds of noise. For example, for a period of 200 days, which corresponds to the habitable region around a K1 dwarf such as , the detection limits for the 3N3 strategy ranges between 2.5 and 3.5 depending on the activity level. Following these results, it seems clear that even including the activity noise related to spot groups, planets smaller than 5 in habitable regions could be detected with HARPS using an appropriate observational strategy.
Future instruments, like ESPRESSO at the VLT, will reach a precision level of 10 cm s. The thick line at the very bottom of Fig. 9 shows the improvement when we change the instrumental noise from 80 to 10 cm s. Evidently the improvement is significant, reducing approximately the mass detection limit by 1 . We note, however, that because we used HARPS data as a basis for our simulations, it is possible that we overestimate the noise of the ESPRESSOlike radialvelocities. These latter values must then be seen as upper limits.
5 Planet detection according to Bern’s model
The detection limits give us an idea of the smallest planet that could be found with an appropriate strategy. Nevertheless, this does not inform us on the proportion of small mass planets that could be detected. This requires a model of planetary formation that can predict the ”true” population of small planets. We used Bern’s model (e.g. Mordasini et al. 2009a, b), which is a good proxy of planet population. In their simulation they solve as in classical core accretion models the internal structure equation for the forming giant planet, but include at the same time disk evolution and type I and II planetary migration.
Using the detection limits derived in Sect. 4, we calculate the number of planets predicted by Bern’s model that are above a given detectable level. The proportion of expected planets between and that could be found with the 3N3 strategy on HARPS is represented in the left panel of Fig. 10. In this panel, we clearly see that the proportion of detection decreases when the activity level increases. For the period range between 100 and 200 days, the proportion of planets with that would be found with HARPS changes from 35 % to 15 % when the varies from to . Although the activity greatly reduces the number of detectable planets, it is still possible to find some in the habitable region of K dwarfs (200 days) for the highest case of activity ( for lowactivity stars, see Sect. 2).
We note for a similar activity level that the value of the bin for 300 to 400 days is lower than the one for 400 to 500 days. Because the mass detection limits increase with period, we expect the opposite. This effect is owing to the observational calendar used. Indeed, because stars in the sky can generally be followed during eight months per year before they disappear (an effect taken into account in our observational calendar), the one year period is not well sampled, which complicates the detection. This is well illustrated in Fig. 9, were a peak near one year appears for each detection limit.
A comparison with what could be found using ESPRESSO is shown in the right panel of Fig. 10. If the activity level is set to , ESPRESSO could find 80 % of the planets in a mass range from 1 to 5 and a period between 100 and 200 days, whereas HARPS could only find 35 % of them.
6 Detection of a simulated planet
To show that the calculated detection limits are realistic, we present in this section an example of planet detection. For , we first create some synthetic RVs including all types of noise, oscillations, granulation phenomena, and activity. The strategy used is the 3N3 and the activity level is set to . Secondly, we add the RV signal induced by a small planet in the habitable zone (, a 208 day period). To check whether the planet is detected or not, we calculate the periodogram and the 1 % and 0.1 % FAP, using bootstrap randomization (see Sect. 4).
In Fig. 11, top panels, we show the raw RVs, stellar noise plus planet, for the 3N3 strategy as well as the added planet. The periodogram with the 1 % FAP for each period is also represented. The peak at 200 days is far above the 0.1 % FAP, which proves that the planet is easily detected. To compare this with the actual measurements made on HARPS for the highprecision program, we apply the same process for the 1N1 strategy (1N strategy in Paper I). This strategy consists of measuring the star 15 minutes per night on 10 consecutive days per month. The bottom panels of Fig. 11 show the results. Evidently here the planet is far from being detectable.
Looking at Fig. 9, we see that the mass detection limit for 208 days of period and for an activity level is more than 3 for the 3N3 strategy. In this example, we see that using the same strategy and configuration, a 2.5 earth mass planet could easily be detected. Thus the detection limits derived in Sec. 4 are very conservative. This is because we consider only the 10 ”worst” cases when deriving the detection limits.
7 Concluding remarks
In Paper I we had proposed an efficient observational strategy to reduce the stellar noise generated by oscillation and granulation as much as possible. This strategy, requiring three measurements per night of 10 minutes over 10 consecutive days each month, improves the averaging out of stellar noise coming from oscillation and granulation by 30 %, with an observational cost only multiplied by a factor 2. In the present paper, we add the noise induced by activityrelated stellar spot groups.
To study the radialvelocity (RV) effect caused by stellar spot groups, we consider three different activity levels. Based on Sun observations, we simulate the minimum solar activity level (), the maximum one (), and an intermediate one (). Comparing the RV variation at maximum activity given by our simulation with the one calculated by real position and size of spot groups from cycle 23 (Meunier et al. 2010), we find a very good agreement, 51 cm s and 48 cm s, respectively. We note that the RV effect of activity is not fully simulated because the inhibition of convection in active regions is not yet implemented. According to Meunier et al. (2010), this effect could be important and consequently, the detection limits calculated here could be underestimated. Compared to the results of Meunier et al. (2010), our simulation is more general in the sense that we can predict the RV effect of activity related spots groups for other stars, knowing a few characteristics (mean spot number, distribution of spot groups lifetime, presence of active longitudes, differential rotation). This will be very important when a better knowledge of activity phenomena of other stars than the Sun will be known. The Kepler mission should give us some clues in the near future.
Modeling the shortterm activity with a three sine waves function with period , and can greatly reduce the spotinduced noise by approximately 70 %. However, we have to be very careful with this method because can vary in a non negligible way from the rotational period of the star. Thus, signal of small mass planets with period similar to the rotational period of the star will be killed by this type of model. Only a study of other CCF parameters such as the FWHM or the BIS can give us clues on the true nature of the signal: shortterm activity or planet.
After simulating the effect of several observational strategies, the most efficient one to average out all kinds of noise is the 3N3. This strategy consists in measuring the star three times 10 minutes per night, with a spacing of two hours. Then the measurements are taken every third night, 10 days every month. With this strategy it would be possible to find planets of 2.5 to 3.5 Earth mass with HARPS in the habitable region of K dwarfs (200 days). The first mass value corresponds to a case without activity () and the second one to a maximum activity level (). Even if the activity caused by spot groups introduces a non negligible noise, small mass planets in habitable regions could be detected with HARPS with an appropriate observational strategy.
Using the population of low mass planets predicted by Bern’s model, we calculate the proportion of planets that could be found with the 3N3 strategy. Using HARPS, it would be possible to find 35 % of the planets below with a period between 100 and 200 days and a lowactivity level (). This value decreases to 15 % for a equal to . If we trust this model of formation, which is the most compatible with observations, HARPS could discover several planets below in the habitable region of earlyK dwarfs.
In our simulation, the 3N3 strategy appears to be the best one to average out activity noise. This is because of the rotational period of the Sun, which is fixed in our simulation at 26 days. Indeed, when choosing 10 observational nights a month, the best way to sample the entire rotational period is to observe the star every third night. For shorter rotational periods, increasing the measurement frequency would lead to a better averaging. For longer rotational periods, reducing the measurement frequency is not recommended because short period planets will be poorly sampled.
Taking three measurements of 10 minutes per night gives us a very low photon noise when binning the data over the night. Thus, the photonnoise is not a limitation any longer, and 2 to 4meters class telescopes can still be used to go down to the 10 cm s level on bright stars. However, spectrographs must also intrinsically reach this level of precision and for the moment, only ESPRESSO@VLT (http://espresso.astro.up.pt/) is designed to reach this goal. With such a precision, improvements would be remarkable. The detection limits would go down in mass by a level of 1 , reaching 1.3 in the habitable region of earlyK dwarfs. In addition, ESPRESSO would find 80 % of the planets between 1 and 5 and with a period between 100 and 200 days, which is more than twice as much as what HARPS could find. Moreover, CODEX@EELT (e.g Pasquini et al. 2008), would improve the long term instrumental precision down to a few cm s level and would detect planets in habitable regions.
Acknowledgements.
X. Dumusque would like to acknowledge Y. Alibert, J. Hall and W. Lockwood for private communication. We acknowledge the support by the European Research Council/European Community under the FP7 through Starting Grant agreement number 239953. NCS also acknowledges the support from Fundação para a Ciência e a Tecnologia (FCT) through program Ciência 2007 funded by FCT/MCTES (Portugal) and POPH/FSE (EC), and in the form of grants reference PTDC/CTEAST/098528/2008 and PTDC/CTEAST/098604/2008. SGS is supported by grant SFRH/BPD/47611/2008 from FCT/MCTES.. Finally, this work was supported by the European Helio and Asteroseismology Network (HELAS), a major international collaboration funded by the European Commission’s Sixth Framework Program (grant : FP62004Infrastructures5026183).Footnotes
 see The Extrasolar Planets Encyclopaedia, http://exoplanet.eu
 Large spot groups represent 11% of the total number of spot groups observed, corresponding to 67% of the total spot group area.
 The filling factor is just the ratio of the area of the spot group over the area of the visible hemisphere.
 Because 68 % of the values generated by a Gaussian distribution will fall in the range , where is the mean and the dispersion, this technique allows us to put 68% of the total spot groups area at around the active longitudes (67 % observed, see Sect. 2.1).
References
 Baliunas, S. L., Donahue, R. A., Soon, W. H., et al. 1995, apj, 438, 269
 Basri, G., Walkowicz, L. M., Batalha, N., et al. 2010, ApJ, 713, L155
 Bedding, T. R. & Kjeldsen, H. 2003, PASA, 20, 203
 Bedding, T. R. & Kjeldsen, H. 2007, Communications in Asteroseismology, 150, 106
 Berdyugina, S. V. & Usoskin, I. G. 2003, aap, 405, 1121
 Boisse, I., Bouchy, F., Hebrard, G., et al. 2010, ArXiv eprints
 Bouchy, F. & Carrier, F. 2003, Ap&SS, 284, 21
 Brandt, P. N. & Solanki, S. K. 1990, A&A, 231, 221
 Broomhall, A., Chaplin, W. J., Davies, G. R., et al. 2009, mnras, 396, L100
 Chapman, G. A., Cookson, A. M., Dobias, J. J., & Walton, S. R. 2001, apj, 555, 462
 ChristensenDalsgaard, J. 2004, solphys, 220, 137
 de Jager, C. & Nieuwenhuijzen, H. 1987, aap, 177, 217
 Del Moro, D. 2004, A&A, 428, 1007
 Del Moro, D., Berrilli, F., Duvall, Jr., T. L., & Kosovichev, A. G. 2004, solphys, 221, 23
 Dravins, D. 1982, ARA&A, 20, 61
 Dumusque, X., Udry, S., Lovis, C., Santos, N. C., & Monteiro, M. J. P. F. G. 2011, A&A, 525, A140+
 Efron, B. & Tibshirani, R. J. 1998, An Introduction to the Bootstrap (Chapman & Hall/CRC)
 Endl, M., Kürster, M., Els, S., Hatzes, A. P., & Cochran, W. D. 2001, A&A, 374, 675
 Gray, D. F. 1992, The Observation and Analysis of Stellar Photospheres (The Observation and Analysis of Stellar Photospheres, by David F. Gray, pp. 470. ISBN 0521408687. Cambridge, UK: Cambridge University Press, June 1992.)
 Hall, J. C., Lockwood, G. W., & Skiff, B. A. 2007, aj, 133, 862
 Hathaway, D. H. & Choudhary, D. P. 2008, solphys, 250, 269
 Henry, T. J., Soderblom, D. R., Donahue, R. A., & Baliunas, S. L. 1996, AJ, 111, 439
 Howard, R. & Murdin, P. 2000, Sunspot Evolution
 Howard, R. F. 1996, araa, 34, 75
 Hoyt, D. V. & Schatten, K. H. 1998, Sol. Phys., 179, 189
 Huélamo, N., Figueira, P., Bonfils, X., et al. 2008, aap, 489, L9
 Ivanov, E. V. 2007, Advances in Space Research, 40, 959
 Lagrange, A., Desort, M., & Meunier, N. 2010, ArXiv eprints
 Livingston, W. C. 1982, Nature, 297, 208
 Lockwood, G. W., Skiff, B. A., Henry, G. W., et al. 2007, apjs, 171, 260
 Mayor, M., Bonfils, X., Forveille, T., et al. 2009a, A&A, 507, 487
 Mayor, M., Udry, S., Lovis, C., et al. 2009b, aap, 493, 639
 Meunier, N., Desort, M., & Lagrange, A. 2010, A&A, 512, A39+
 Mordasini, C., Alibert, Y., & Benz, W. 2009a, A&A, 501, 1139
 Mordasini, C., Alibert, Y., Benz, W., & Naef, D. 2009b, A&A, 501, 1161
 Noyes, R. W., Hartmann, L. W., Baliunas, S. L., Duncan, D. K., & Vaughan, A. H. 1984, ApJ, 279, 763
 O’Toole, S. J., Tinney, C. G., & Jones, H. R. A. 2008, MNRAS, 386, 516
 Pasquini, L., Avila, G., Dekker, H., et al. 2008, in Society of PhotoOptical Instrumentation Engineers (SPIE) Conference Series, Vol. 7014, Society of PhotoOptical Instrumentation Engineers (SPIE) Conference Series
 Pepe, F., Mayor, M., Queloz, D., et al. 2005, The Messenger, 120, 22
 Queloz, D., Bouchy, F., Moutou, C., et al. 2009, A&A, 506, 303
 Queloz, D., Henry, G. W., Sivan, J. P., et al. 2001, A&A, 379, 279
 Saar, S. H. & Donahue, R. A. 1997, ApJ, 485, 319
 Santos, N. C., Bouchy, F., Mayor, M., et al. 2004, A&A, 426, L19
 Schrijver, C. J. & Zwann, C. 2000, Solar and stellar magnetic activity (Cambridge U Press, 2000)
 Selsis, F., Chazelas, B., Bordé, P., et al. 2007, Icarus, 191, 453
 Title, A. M., Tarbell, T. D., Topka, K. P., et al. 1989, ApJ, 336, 475
 Udry, S., Bonfils, X., Delfosse, X., et al. 2007, A&A, 469, L43
 Vaughan, A. H. & Preston, G. W. 1980, pasp, 92, 385
 Wright, J. T., Marcy, G. W., Butler, R. P., et al. 2008, ApJ, 683, L63