Should we use bibliometric indices to evaluate research? Denis - - PowerPoint PPT Presentation

should we use bibliometric indices to evaluate research
SMART_READER_LITE
LIVE PREVIEW

Should we use bibliometric indices to evaluate research? Denis - - PowerPoint PPT Presentation

Should we use bibliometric indices to evaluate research? Denis Bouyssou CNRSLAMSADE LINA February (based on joint work with Thierry Marchant, Ghent University, Belgium) If you do not know Thierry. . . Outline 1 Bibliometrics 2


slide-1
SLIDE 1

Should we use bibliometric indices to evaluate research?

Denis Bouyssou CNRS–LAMSADE LINA February  (based on joint work with Thierry Marchant, Ghent University, Belgium)

slide-2
SLIDE 2

If you do not know Thierry. . .

slide-3
SLIDE 3

Outline

1 Bibliometrics 2 Model & Results 3 Discussion

slide-4
SLIDE 4

Outline

1 Bibliometrics 2 Model & Results 3 Discussion

slide-5
SLIDE 5

Bibliometrics Context

Academia

General context globalization knowledge economy financial and economic crisis

4

slide-6
SLIDE 6

Bibliometrics Context

Academia

General context globalization knowledge economy financial and economic crisis Impacts on academia budget cuts arrival of new players (China, India) increased mobility of staff & students proliferation of evaluation & funding agencies proliferation of indices & rankings industrialization of academia

4

slide-7
SLIDE 7

Bibliometrics Context

Industrialization of academia

Symptoms AERES + LRU + ANR + fusions of Universities + teaching in English + LESR students’ demonstrations (Printemps ´ erable & UK) + students’ debt crisis fraud & plagiarism increase evaluation fever

bibliometric indices everywhere

5

slide-8
SLIDE 8

Bibliometrics Bibliometrics

Bibliometrics

Two extreme positions bibliometrics is an absolute evil bibliometrics brings objectivity and fairness

⑧ ⑧ ⑧

6

slide-9
SLIDE 9

Bibliometrics Bibliometrics

Bibliometrics

Two extreme positions bibliometrics is an absolute evil bibliometrics brings objectivity and fairness Both positions are plainly wrong!

⑧ ⑧ ⑧

6

slide-10
SLIDE 10

Bibliometrics Bibliometrics

Bibliometrics

Bibliometrics defined using mathematical and statistical techniques to study publishing and communication patterns

7

slide-11
SLIDE 11

Bibliometrics Bibliometrics

Bibliometrics

Bibliometrics defined using mathematical and statistical techniques to study publishing and communication patterns The field of Bibliometrics active scientific field

journals: Scientometrics, Journal of Informetrics, Journal of the American Society for Information Science and Technology, Research Policy, . . . ISSI: International Society for Scientometrics and Informetrics regular International Conferences

7

slide-12
SLIDE 12

Bibliometrics Bibliometrics

Bibliometrics

Some research questions bibliometric laws: Lotka, Bradford social network of {scientists, papers, fields} efficiency of research policy of a country factors influencing transfer of knowledge towards industry which journals should libraries subscribe to? impact of open access on diffusion on knowledge strong and weak research fields of a country emerging fields

8

slide-13
SLIDE 13

Journal of Economic Literature 2008 IF (3.65) (frequency of number of citations in 2008 to paper published in 2006–2007)

slide-14
SLIDE 14

Bart summarizes

slide-15
SLIDE 15

Map of 800 terms co-occurrencing in abstracts of OR journals (VOSviewer)

slide-16
SLIDE 16

Map of ISI fields (VOSviewer)

Molecular & Cell Biology

Medicine Physics

Ecology & Evolution Economics Geosciences Psychology Chemistry Psychiatry Environmental Chemistry & Microbiology Mathematics

Computer Science Analytic Chemistry Business & Marketing Political Science Fluid Mechanics Medical Imaging Material Engineering Sociology Probability & Statistics Astronomy & Astrophysics Gastroenterology Law Chemical Engineering Education Telecommunication Control Theory Operations Research Ophthalmology Crop Science Geography Anthropology Computer Imaging Agriculture Parasitology Dentistry Dermatology Urology Rheumatology Applied Acoustics Pharmacology Pathology

Otolaryngology Electromagnetic Engineering Circuits Power Systems Tribology

Neuroscience

Orthopedics Veterinary Environmental Health

A

Citation flow from B to A Citation flow within field Citation flow from A to B Citation flow out of field

B

slide-17
SLIDE 17

Bibliometrics Evaluative bibliometrics

Evaluative bibliometrics and bibliometric indices

Evaluative bibliometrics publications in journals are the central research output citations to publications are important signs of recognition the more publication & citations you have the better “bibliometrically limited view of a complex reality” (A. van Raan, 2005)

13

slide-18
SLIDE 18

Bibliometrics Evaluative bibliometrics

Evaluative bibliometrics and bibliometric indices

Evaluative bibliometrics publications in journals are the central research output citations to publications are important signs of recognition the more publication & citations you have the better “bibliometrically limited view of a complex reality” (A. van Raan, 2005) count publications & citations summarize these counts by indices

13

slide-19
SLIDE 19

Bibliometrics Evaluative bibliometrics

Evaluative bibliometrics and bibliometric indices

Databases Web of Science (ISI, Thomson Reuters) Scopus (Elsevier) Google Scholar (PoP + Google) Record publications and citations Online uses during evaluation committees by often uninformed users

14

slide-20
SLIDE 20

DB: 456 papers, 3464 citations, h-index = 27

slide-21
SLIDE 21

DB: 42 papers, 415 citations, h-index = 12

slide-22
SLIDE 22

DB: 2929 citations, h-index = 27

slide-23
SLIDE 23

DB: 42 papers, 390 citations, h-index = 9

slide-24
SLIDE 24

Bibliometrics Warnings

A few words of warning

Databases cleaning is needed and not easy to do!

spelling errors + incorrect citations names: diacritical signs, T EX ligatures, transliteration, homonyms (Martel in Qu´ ebec, Kim or Park in Korea) correct affiliations are extremely difficult to determine counting: original articles, letters, notes, erratum, obituaries, reviews, editorials lost citations (up to 30%)

important differences between fields

publication intensity citation intensity & behavior longevity of papers (months vs decades)

19

slide-25
SLIDE 25

Citation intensity for the 21 ISI categories

slide-26
SLIDE 26

Bibliometrics Warnings

A few more words of warning

Science is not immune to social effects peer review has documented defects (tests / retests) motives for citation are diverse (negative citations, perfunctory citations) self citations and network effects manipulation of the JIF by editors Humbolt & Merton vs Bourdieu

21

slide-27
SLIDE 27

Bibliometrics Warnings

A few more words of warning

Science is not immune to social effects peer review has documented defects (tests / retests) motives for citation are diverse (negative citations, perfunctory citations) self citations and network effects manipulation of the JIF by editors Humbolt & Merton vs Bourdieu Nightmares how to deal with multiple authors (sometimes more than 1 000) how to deal with multiple affiliations what is an author? (ghost authors, unequal contributions, . . . ) people react and adapt quickly: perverse effects are pervasive epistemology: normal science vs paradigm shifts (Kuhn)

21

slide-28
SLIDE 28

Examples of papers with many authors

slide-29
SLIDE 29

Bibliometrics Bibliometric indices

Bibliometric indices

Hypotheses all above problems have been taken care of you have a good, verified, and cleaned database Many possible indices counting of papers counting of citations sum of Impact Factors Markovian indices (PageRank) h-index and its variants

23

slide-30
SLIDE 30

Bibliometrics Bibliometric indices

Bibliometric indices

Hypotheses all above problems have been taken care of you have a good, verified, and cleaned database Many possible indices counting of papers counting of citations sum of Impact Factors Markovian indices (PageRank) h-index and its variants Bibliometric Indices what properties? how to compare them? how to combine them?

23

slide-31
SLIDE 31

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (1/2)

Evaluation of authors h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each) author f: 4 papers with 4 citations each author g: 3 papers with 6 citations each

ih(f) = 4 > ih(g) = 3

24

slide-32
SLIDE 32

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (1/2)

Evaluation of authors h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each) author f: 4 papers with 4 citations each author g: 3 papers with 6 citations each

ih(f) = 4 > ih(g) = 3 both authors publish a new paper with 6 citations ih(f ∗) = 4 = ih(g∗) = 4

24

slide-33
SLIDE 33

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (1/2)

Evaluation of authors h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each) author f: 4 papers with 4 citations each author g: 3 papers with 6 citations each

ih(f) = 4 > ih(g) = 3 both authors publish a new paper with 6 citations ih(f ∗) = 4 = ih(g∗) = 4 both authors publish a new paper with 6 citations ih(f ∗∗) = 4 < ih(g∗∗) = 5

24

slide-34
SLIDE 34

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (1/2)

Evaluation of authors h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each) author f: 4 papers with 4 citations each author g: 3 papers with 6 citations each

ih(f) = 4 > ih(g) = 3 both authors publish a new paper with 6 citations ih(f ∗) = 4 = ih(g∗) = 4 both authors publish a new paper with 6 citations ih(f ∗∗) = 4 < ih(g∗∗) = 5 Conclusion Independence is violated

24

slide-35
SLIDE 35

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (2/2)

Evaluation of authors and departments h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each)

25

slide-36
SLIDE 36

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (2/2)

Evaluation of authors and departments h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each)

Department a = (a1, a2) author a1: 4 papers each one cited 4 times author a2: 4 papers each one cited 4 times

h-index of both authors is 4 h-index of the department is 4

25

slide-37
SLIDE 37

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (2/2)

Evaluation of authors and departments h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each)

Department a = (a1, a2) author a1: 4 papers each one cited 4 times author a2: 4 papers each one cited 4 times

h-index of both authors is 4 h-index of the department is 4

Department b = (b1, b2) author b1: 3 papers each one cited 6 times author b2: 3 papers each one cited 6 times

h-index of both authors is 3 h-index of the department is 6

25

slide-38
SLIDE 38

Bibliometrics Problems with bibliometric indices

Potential problems with the h-index (2/2)

Evaluation of authors and departments h-index

the h-index of an author is x if this author has x papers having at least x citations each (and her other papers have at most x citations each)

Department a = (a1, a2) author a1: 4 papers each one cited 4 times author a2: 4 papers each one cited 4 times

h-index of both authors is 4 h-index of the department is 4

Department b = (b1, b2) author b1: 3 papers each one cited 6 times author b2: 3 papers each one cited 6 times

h-index of both authors is 3 h-index of the department is 6

Conclusion the “best” department contains the “worst” authors!

25

slide-39
SLIDE 39

Bart summarizes

slide-40
SLIDE 40

Outline

1 Bibliometrics 2 Model & Results 3 Discussion

slide-41
SLIDE 41

Model & Results Authors

Model of Authors

Authors an author is a function f from N to N f(x) is the number of papers by this author having received x citations

28

slide-42
SLIDE 42

Model & Results Authors

Model of Authors

Authors an author is a function f from N to N f(x) is the number of papers by this author having received x citations Set of all Authors A is the set of all functions f from N to N such that

  • x∈N

f(x) is finite

28

slide-43
SLIDE 43

Model & Results Authors

Model of Authors

Authors an author is a function f from N to N f(x) is the number of papers by this author having received x citations Set of all Authors A is the set of all functions f from N to N such that

  • x∈N

f(x) is finite Objective build a binary relation on A f g is “given their publication/citation record, scientists f is at least as good as scientist g”

28

slide-44
SLIDE 44

Model & Results Authors

Model of Authors

Authors an author is a function f from N to N f(x) is the number of papers by this author having received x citations Set of all Authors A is the set of all functions f from N to N such that

  • x∈N

f(x) is finite Objective build a binary relation on A f g is “given their publication/citation record, scientists f is at least as good as scientist g” Important Limitation coauthors are ignored in this talk

28

slide-45
SLIDE 45

Model & Results Authors

Notation and remarks

Notation 0 is an author without any paper 1x is an author with 1 paper having received x citations

29

slide-46
SLIDE 46

Model & Results Authors

Notation and remarks

Notation 0 is an author without any paper 1x is an author with 1 paper having received x citations Remarks Authors are modelled as functions it makes sense to add two authors f and g: f + g it makes sense to multiply an author f by an integer n: n · f

29

slide-47
SLIDE 47

Model & Results Departments

Model of Departments

Departments a department of size k is an element of A k: (f1, f2, . . . , fk)

30

slide-48
SLIDE 48

Model & Results Departments

Model of Departments

Departments a department of size k is an element of A k: (f1, f2, . . . , fk) Set of all Departments D =

  • k∈N

A k

30

slide-49
SLIDE 49

Model & Results Departments

Model of Departments

Departments a department of size k is an element of A k: (f1, f2, . . . , fk) Set of all Departments D =

  • k∈N

A k Objective build a binary relation on D A B is “given their publication/citation record of the scientists in departments A and B, department A is at least as good as department B”

30

slide-50
SLIDE 50

Model & Results Departments

Model of Departments

Departments a department of size k is an element of A k: (f1, f2, . . . , fk) Set of all Departments D =

  • k∈N

A k Objective build a binary relation on D A B is “given their publication/citation record of the scientists in departments A and B, department A is at least as good as department B” Limitations multiple affiliations are ignored field normalization is ignored

30

slide-51
SLIDE 51

Model & Results Axioms

Axioms

Consistency Let A = (a1, a2, . . . , ak) and B = (b1, b2, . . . , bk) be two departments of size k. If ai bi, for all i ∈ {1, 2, . . . , k} then A B Furthermore if ai ≻ bi, for some i ∈ {1, 2, . . . , k} then A ⊲ B

31

slide-52
SLIDE 52

Model & Results Axioms

Axioms

Consistency Let A = (a1, a2, . . . , ak) and B = (b1, b2, . . . , bk) be two departments of size k. If ai bi, for all i ∈ {1, 2, . . . , k} then A B Furthermore if ai ≻ bi, for some i ∈ {1, 2, . . . , k} then A ⊲ B Independence For all f, g ∈ A and all x ∈ N f g ⇔ f + 1x g + 1x

31

slide-53
SLIDE 53

Model & Results Axioms

Axioms

Consistency Let A = (a1, a2, . . . , ak) and B = (b1, b2, . . . , bk) be two departments of size k. If ai bi, for all i ∈ {1, 2, . . . , k} then A B Furthermore if ai ≻ bi, for some i ∈ {1, 2, . . . , k} then A ⊲ B Independence For all f, g ∈ A and all x ∈ N f g ⇔ f + 1x g + 1x Transfer For all A = (a1, a2, . . . , ak) ∈ D, all i, j ∈ {1, 2, . . . , k} and all x ∈ N (a1, . . . , ai + 1x, . . . , ak) (a1, . . . , aj + 1x, . . . , ak)

31

slide-54
SLIDE 54

Model & Results Axioms

Interpretation and Results

Interpretation Consistency appears uncontroversial Independence appears uncontroversial Transfer is strong (but used quite often)

“Inequalities” within departments are ignored

32

slide-55
SLIDE 55

Model & Results Axioms

Interpretation and Results

Interpretation Consistency appears uncontroversial Independence appears uncontroversial Transfer is strong (but used quite often)

“Inequalities” within departments are ignored

Proposition 1 If and are linked by Consistency and if satisfies Transfer then satisfies Independence

32

slide-56
SLIDE 56

Model & Results Axioms

Interpretation and Results

Interpretation Consistency appears uncontroversial Independence appears uncontroversial Transfer is strong (but used quite often)

“Inequalities” within departments are ignored

Proposition 1 If and are linked by Consistency and if satisfies Transfer then satisfies Independence Corollary If is the ranking of authors based on the h-index then there is no such that Transfer and Consistency hold

32

slide-57
SLIDE 57

Model & Results Scoring rules

Scoring rules for scientists

Definition 1 is a scoring rule for scientists (s-scoring rule) if there is a real valued function u on N such that f g ⇔

  • x∈N

f(x)u(x) ≥

  • x∈N

g(x)u(x) u(x) gives the worth of one publication with x citations many bibliometric indices are scoring rules (but not the h-index) all scoring rules satisfy independence

33

slide-58
SLIDE 58

Model & Results Scoring rules

Scoring rules for scientists

Definition 1 is a scoring rule for scientists (s-scoring rule) if there is a real valued function u on N such that f g ⇔

  • x∈N

f(x)u(x) ≥

  • x∈N

g(x)u(x) u(x) gives the worth of one publication with x citations many bibliometric indices are scoring rules (but not the h-index) all scoring rules satisfy independence Examples u(x) = x: number of citations u(x) = 1: number of publications u(x) = 1 if x ≥ α: number of highly cited publications

33

slide-59
SLIDE 59

Model & Results Scoring rules

Rules for departments

Definition 2 is a scoring rule for departments (d-scoring rule) if there is a real valued function v on N such that (a1, a2, . . . , ak) (b1, b2, . . . , bℓ) ⇔

k

  • i=1
  • x∈N

ai(x)v(x) ≥

  • i=1
  • x∈N

bi(x)v(x)

34

slide-60
SLIDE 60

Model & Results Scoring rules

Rules for departments

Definition 2 is a scoring rule for departments (d-scoring rule) if there is a real valued function v on N such that (a1, a2, . . . , ak) (b1, b2, . . . , bℓ) ⇔

k

  • i=1
  • x∈N

ai(x)v(x) ≥

  • i=1
  • x∈N

bi(x)v(x) Definition 3 is an averaging rule for departments (d-averaging rule) if there is a real valued function v on N such that (a1, a2, . . . , ak) (b1, b2, . . . , bℓ) ⇔ 1 k

k

  • i=1
  • x∈N

ai(x)v(x) ≥ 1 ℓ

  • i=1
  • x∈N

bi(x)v(x)

34

slide-61
SLIDE 61

Model & Results Scoring rules

Axioms

Archimedeanness For all f, g, f ′, g′ ∈ A such that f ≻ g there is n ∈ N such that f ′ + (n · f) g′ + (n · g)

35

slide-62
SLIDE 62

Model & Results Scoring rules

Axioms

Archimedeanness For all f, g, f ′, g′ ∈ A such that f ≻ g there is n ∈ N such that f ′ + (n · f) g′ + (n · g) Dummy Scientist For all k ∈ N and all (a1, a2, . . . , ak) ∈ D (a1, a2, . . . , ak) (a1, a2, . . . , ak, 0)

35

slide-63
SLIDE 63

Model & Results Scoring rules

Axioms

Archimedeanness For all f, g, f ′, g′ ∈ A such that f ≻ g there is n ∈ N such that f ′ + (n · f) g′ + (n · g) Dummy Scientist For all k ∈ N and all (a1, a2, . . . , ak) ∈ D (a1, a2, . . . , ak) (a1, a2, . . . , ak, 0) Homogeneity For all k, n ∈ N and all (a1, a2, . . . , ak) ∈ D (a1, a2, . . . , ak) (a1, a1, . . . , a1

  • n

, a2, a2, . . . , a2

  • n

, . . . , ak, ak, . . . , ak

  • n

)

35

slide-64
SLIDE 64

Model & Results Scoring rules

Remarks

all s-scoring rules satisfy Archimedeanness Dummy Scientist is satisfied by d-scoring rules but not by d-averaging rules Homogeneity is satisfied by d-averaging rules but not by d-scoring rules

36

slide-65
SLIDE 65

Model & Results Results

Some results

Theorem 1 (B & Marchant, 2011) The relations and are linked by Consistency, satisfies Transfer and Dummy Scientist, satisfies Archimedeanness if and only if is an s-scoring rule and is a d-scoring rule with u = v The function u is unique up to the multiplication by a positive constant

37

slide-66
SLIDE 66

Model & Results Results

Some results

Theorem 1 (B & Marchant, 2011) The relations and are linked by Consistency, satisfies Transfer and Dummy Scientist, satisfies Archimedeanness if and only if is an s-scoring rule and is a d-scoring rule with u = v The function u is unique up to the multiplication by a positive constant Theorem 2 (B & Marchant, 2011) The relations and are linked by Consistency, satisfies Transfer and Homogeneity, satisfies Archimedeanness if and only if is an s-scoring rule and is a d-averaging rule with u = v The function u is unique up to the multiplication by a positive constant

37

slide-67
SLIDE 67

Model & Results Results

Extensions

Extensions add additional conditions to restrict the shape of u

u is nondecreasing u is constant u is linear

characterize indices instead of rankings Easy!

38

slide-68
SLIDE 68

Outline

1 Bibliometrics 2 Model & Results 3 Discussion

slide-69
SLIDE 69

Discussion

Discussion of results

Axioms Consistency is highly desirable Independence is highly desirable (but violated by the h-index) Archimedeanness is technical Transfer is more debatable (anonymity & inequality)

40

slide-70
SLIDE 70

Discussion

Discussion of results

Axioms Consistency is highly desirable Independence is highly desirable (but violated by the h-index) Archimedeanness is technical Transfer is more debatable (anonymity & inequality) Extensions coauthors multiple affiliations field normalization

40

slide-71
SLIDE 71

Discussion

Discussion of results

Axioms Consistency is highly desirable Independence is highly desirable (but violated by the h-index) Archimedeanness is technical Transfer is more debatable (anonymity & inequality) Extensions coauthors multiple affiliations field normalization Warning beware of institutions using the h-index!

40

slide-72
SLIDE 72

Discussion

Messages

Bibliometrics bibliometrics is not limited to evaluative bibliometrics evaluative bibliometrics is an interesting field of study many wrong beliefs are floating around

41

slide-73
SLIDE 73

Discussion

Messages

Bibliometrics bibliometrics is not limited to evaluative bibliometrics evaluative bibliometrics is an interesting field of study many wrong beliefs are floating around Evaluative bibliometrics in practice it should be used with much care it should not be in the hands of laypersons it should not be entrenched in formal rules it can be useful if used together with careful and impartial peer review

41

slide-74
SLIDE 74

Discussion

Messages

Bibliometrics bibliometrics is not limited to evaluative bibliometrics evaluative bibliometrics is an interesting field of study many wrong beliefs are floating around Evaluative bibliometrics in practice it should be used with much care it should not be in the hands of laypersons it should not be entrenched in formal rules it can be useful if used together with careful and impartial peer review Excellence: IDEX, LABEX, PES excellence is another word for outliers

not everyone can be excellent! what should we do with people that are not excellent? is the mantra of excellence a good motivating tool?

41

slide-75
SLIDE 75

References

Adler, R., Ewing, J., Taylor, P. (2009) Citation statistics Statistical Science, 24 (1), 1–14 Billaut, J.-C., Bouyssou, D., Vincke, Ph. (2011) Should you believe in the Shanghai ranking? An MCDM view Scientometrics, 84 (1), 237–263 Bouyssou, D., Marchant, T. (2011) Ranking scientists and departments in a consistent manner Journal of the American Society for Information Science and Technology, 62 (9), 1761–1769 Bouyssou, D., Marchant, T. (2013) New characterizations of the h-index Working Paper, LAMSADE, 54 pages Marchant, T. (2009) Score-Based Bibliometric Rankings of Authors Journal of the American Society for Information Science and Technology, 60 (6), 1132–1137

slide-76
SLIDE 76
slide-77
SLIDE 77

Bart summarizes

slide-78
SLIDE 78