sample N个数中随机选M个

void RandomSelectMofN( int * a, int N, int M) {
sample N个数中随机选M个

// pls refer to Knuth's TAOCP
sample N个数中随机选M个

int n;

srand(unsigned(time(NULL)));
sample N个数中随机选M个

for(n=N;n>0;n--){
sample N个数中随机选M个

if( rand()%n < M ){ // assume bigrand.
sample N个数中随机选M个

cout<<N-n<<endl;
sample N个数中随机选M个

m--;

}

let's prove this algorithm:

for convenience ,change to for(int i=0;i<n;i++){}

now, we have randomly selected t elements in i elements, then we want to selected the next one(named x).

there are C(n-i, m-t) kinds of choice to randomly select the last m-t elements.

and, there are C(n-i-1, m-t-1) kinds of choice to select the last m-t elements which contains the ith element(x).

so C(n-i, m-t) / C(n-i-1, m-t-1) = m-t / n-i, which is the probability of selecting ith element in our code.

----------------------------------------------

former wrong proof:

each element should be selected with a probability of m/n.

now, we have randomly selected t elements in i elements, then we want to selected the next one(named x).

Because there are N-i left, and we have to selected M-t more,

so the probability to selected x should be (M-t)/(N-i).

秒客网