EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #10326


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

Re: [EP-tech] Query for IRSTATS2 Download from Access Table


CAUTION: This e-mail originated outside the University of Southampton.
Yup, include the entity ID also. 

Thanks John, i need to report the figure for my institution data. 

I'm afraid the data insend will be different with the one after the process complete.

I guess it will be different ... LoL

Btw, is there I can do to make the process faster? The resources (Memory, CPU) like normal.

Izwan
UiTM Digital Library

On Fri, 23 Jan 2026, 4:26 pm John Salter, <J.Salter@leeds.ac.uk> wrote:
CAUTION: This e-mail originated outside the University of Southampton.
CAUTION: This e-mail originated outside the University of Southampton.

Hi Izwan,

The filters are defined in the plugins/EPrints/Plugin/Stats/Filter/ - files. In this case Robots.pm and Repeat.pm are the ones you want.

 

Are the ‘Access Table’ figures from your query with the referring_entity_id !='' part in it?

The IRStats2 figures should always be less than the raw access data. Modern browsers may not send a referrer in some cases. Comparing the raw access table counts with the IRStats2 data might be a little bit more consistent – although some days/months your repository will be visited by lots of different robots – other times less.

 

Cheers,

John

 

 

From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> On Behalf Of MOHD.IZWAN SALIM
Sent: 23 January 2026 05:10
To: eprints-tech@ecs.soton.ac.uk
Subject: Re: [EP-tech] Query for IRSTATS2 Download from Access Table

 

CAUTION: External Message. Use caution opening links and attachments.

CAUTION: This e-mail originated outside the University of Southampton.

CAUTION: This e-mail originated outside the University of Southampton.

Dear John, where can I see the filtering for bot, human double click those?

 

The difference is so random

 

 

Access Table

Eprints IRStat2

Different

Jan

             216,293

              279,265

   (62,972)

Feb

             182,542

              196,933

   (14,391)

Mac

             175,017

              198,564

   (23,547)

Apr

             239,973

              250,356

   (10,383)

Mei

             263,108

              227,056

     36,052

Jun

             251,904

              209,344

     42,560

Jul

             263,250

              227,984

     35,266

Ogos

             201,476

              212,356

   (10,880)

Sept

             332,020

              168,318

   163,702

Otk

               44,751

                49,454

     (4,703)

 

Izwan

UiTM Digital Library

 

 

On Fri, Jan 23, 2026 at 4:54AM John Salter <J.Salter@leeds.ac.uk> wrote:

CAUTION: This e-mail originated outside the University of Southampton.

CAUTION: This e-mail originated outside the University of Southampton.

Hi Izwan,
The way IRStats2 processes requests is more complicated than you can easily  replicate in a normal SQL query.

It tries to count ‘human’ usage of content, so has a list of ‘robot’ user-agents (https://www.eprints.org/resource/bad_robots/robots_ua.txt ) which it filters against - as well as other approaches.

If you need to provide some 2025 data for someone, you could select data from the access table – but say that it includes all records – robots, humans, double-clicks etc., so will over-count the figures significantly.

 

Cheers,

John

 

John Salter

https://orcid.org/0000-0002-8611-8266

 

White Rose Libraries Technical Officer
Library and Research Management team, IT
University of Leeds

 

From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> On Behalf Of MOHD.IZWAN SALIM
Sent: 22 January 2026 05:05
To: Eprints Tech <eprints-tech@ecs.soton.ac.uk>
Subject: [EP-tech] Query for IRSTATS2 Download from Access Table

 

CAUTION: External Message. Use caution opening links and attachments.

CAUTION: This e-mail originated outside the University of Southampton.

CAUTION: This e-mail originated outside the University of Southampton.

Dear Eprints

 

Our repo is currently under repocessing the data. And it takes a very long time to reach 2025.

I plan to do a quick query directly from the access table. I can't get the same or near the same as shown in the IRStat screen (for previous year)

 

Example for 2013
Repo Stat Screen shows -186,757

 

Query from the access table  - 282,322

Query with condition referring_entity_id !='' - 165,815

 

My query is 

SELECT COUNT(*) AS Jumlah

FROM access a LEFT JOIN eprint b ON a.referent_id=b.eprintid
WHERE a.datestamp_year='2013' AND service_type_id='?fulltext=yes' AND referring_entity_id !='' AND eprint_status='archive'

 

Is there any other condition I should add to at least be near the one shown in the EPrint page?

 

Izwan
UiTM Digital Library

 

PENAFIAN: E-mel ini dan apa-apa fail yang dihantar bersama-samanya ("Mesej") adalah dihasratkan hanya untuk kegunaan penerima yang dinyatakan di atas dan mungkin mengandungi maklumat yang tidak umum, bermilik, istimewa, sulit dan dikecualikan dari penzahiran di bawah undang-undang yang terpakai termasuklah Akta Rahsia Rasmi 1972. BACA SELANJUTNYA...


DISCLAIMER : This e-mail and any files transmitted with it ("Message") is intended only for the use of the recipient(s) named above and may contain information that is non-public,  proprietary,  privileged,  confidential  and  exempt  from  disclosure under applicable law including the Official Secrets Act 1972. READ MORE...

*** Options: https://wiki.eprints.org/w/Eprints-tech_Mailing_List
*** Archive: https://www.eprints.org/tech.php/
*** EPrints community wiki: https://wiki.eprints.org/

 

PENAFIAN: E-mel ini dan apa-apa fail yang dihantar bersama-samanya ("Mesej") adalah dihasratkan hanya untuk kegunaan penerima yang dinyatakan di atas dan mungkin mengandungi maklumat yang tidak umum, bermilik, istimewa, sulit dan dikecualikan dari penzahiran di bawah undang-undang yang terpakai termasuklah Akta Rahsia Rasmi 1972. BACA SELANJUTNYA...


DISCLAIMER : This e-mail and any files transmitted with it ("Message") is intended only for the use of the recipient(s) named above and may contain information that is non-public,  proprietary,  privileged,  confidential  and  exempt  from  disclosure under applicable law including the Official Secrets Act 1972. READ MORE...

*** Options: https://wiki.eprints.org/w/Eprints-tech_Mailing_List
*** Archive: https://www.eprints.org/tech.php/
*** EPrints community wiki: https://wiki.eprints.org/


PENAFIAN: E-mel ini dan apa-apa fail yang dihantar bersama-samanya ("Mesej") adalah dihasratkan hanya untuk kegunaan penerima yang dinyatakan di atas dan mungkin mengandungi maklumat yang tidak umum, bermilik, istimewa, sulit dan dikecualikan dari penzahiran di bawah undang-undang yang terpakai termasuklah Akta Rahsia Rasmi 1972. BACA SELANJUTNYA...


DISCLAIMER : This e-mail and any files transmitted with it ("Message") is intended only for the use of the recipient(s) named above and may contain information that is non-public,  proprietary,  privileged,  confidential  and  exempt  from  disclosure under applicable law including the Official Secrets Act 1972. READ MORE...