Citation dynamics

From HandWiki
Short description: Number of references received by a scientific work over time
Yearly citation c(t) for a research paper from the e Physical Review corpus.

Citation dynamics describes the number of references received by the article or other scientific work over time. The citation dynamics is usually described by the bang, that take place two–three years after the work has been published, and the burst size spans several orders of magnitude. The presence of bursts is not consistent with other models based on preferential attachment. Those models are able to account for the skewed citation distribution but their reference accumulation is gradual.[1]

The dynamics of scientific production has changed significantly over the past years. Due to technological progress, the number of published papers has been increasing exponentially until now. This, along with a much shorter time needed for the article to be published, has affected the citation dynamics of the modern papers. Furthermore, if the reference list of the study includes papers published in different years, older papers tend to have more citations. This may not necessarily because they are better but just because they had more time to accumulate those references.

Model

It has been found that citation distributions are best described by a shifted power-law.[1] The probability that paper [math]\displaystyle{ i }[/math] is cited at time [math]\displaystyle{ t }[/math] after publication as:

[math]\displaystyle{ \Pi_{i}(t) = \eta_{i}c_i^tP_{i}(t) }[/math]

where [math]\displaystyle{ \Pi_{i}(t) }[/math] serves as the outcome variable for each particular paper [math]\displaystyle{ i }[/math] at time [math]\displaystyle{ t }[/math]. Fitness, [math]\displaystyle{ \eta_i }[/math], captures the inherent differences between papers, accounting for the perceived novelty and importance of a discovery. [math]\displaystyle{ c_i^t }[/math] represents the cumulative number of citations acquired by a paper [math]\displaystyle{ i }[/math] at time [math]\displaystyle{ t }[/math] and [math]\displaystyle{ P_{i}(t) }[/math] is a log-normal survival probability.[2] The probability is equal

[math]\displaystyle{ P_{i}(t) = \frac{1}{\sqrt{2\pi}\sigma_it} exp \left \lbrack -\frac{(ln t - \mu_i)^2}{2\sigma_i^2} \right \rbrack }[/math]

where [math]\displaystyle{ t }[/math] is time; [math]\displaystyle{ \sigma }[/math] is longevity, capturing the decay rate; and [math]\displaystyle{ m }[/math] indicates immediacy, governing the time for a paper to reach its citation peak. The ultimate impact [math]\displaystyle{ c_i^{\infty} }[/math] represents the total number or citations that the paper receives during its lifetime.

[math]\displaystyle{ c_i^{\infty} = m(e^{\lambda_i} - 1) }[/math]

Where [math]\displaystyle{ m }[/math] is a global parameter that has the same value for all publications. [math]\displaystyle{ \lambda_i }[/math] represents the relative fitness of the paper. From the above formula, we can see that the total number of references that the paper can receive during its lifetime depends only on its relative fitness which is very hard to quantify.

See also

References

  1. 1.0 1.1 Eom, Young-Ho; Fortunato, Santo (2011). "Characterizing and Modeling Citation Dynamics". PLOS ONE 6 (9): e24926. doi:10.1371/journal.pone.0024926. PMID 21966387. Bibcode2011PLoSO...624926E. 
  2. Wang, Dashun; Song, Chaoming; Barabási, Albert-László (4 Oct 2013). "Quantifying Long-Term Scientific Impact". doi:10.1126/science.1237825. https://www.science.org/doi/pdf/10.1126/science.1237825. Retrieved 18 May 2017.