Skip to content

slub/labe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

883 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

LABE

Merging citations and catalog data at SLUB Dresden.

Status: testing

Project

The project is composed of a couple of command line tools, both written in Python and Go.

  • ckit, citation toolkit contains an API server, plus a few command line tools (go)
  • python, orchestration helper to assemble data files regularly (python, based on luigi)

Meeting Minutes

Project structure

$ tree -d
.
β”œβ”€β”€ ansible
β”‚Β Β  └── roles
β”‚Β Β      β”œβ”€β”€ common
β”‚Β Β      β”‚Β Β  └── tasks
β”‚Β Β      └── labe
β”‚Β Β          β”œβ”€β”€ defaults
β”‚Β Β          β”œβ”€β”€ tasks
β”‚Β Β          └── templates
β”œβ”€β”€ data
β”œβ”€β”€ extra
β”‚Β Β  └── perfstats
β”œβ”€β”€ go
β”‚Β Β  └── ckit
β”‚Β Β      β”œβ”€β”€ cache
β”‚Β Β      β”œβ”€β”€ cmd
β”‚Β Β      β”‚Β Β  β”œβ”€β”€ doisniffer
β”‚Β Β      β”‚Β Β  β”œβ”€β”€ labed
β”‚Β Β      β”‚Β Β  β”œβ”€β”€ makta
β”‚Β Β      β”‚Β Β  └── tabjson
β”‚Β Β      β”œβ”€β”€ doi
β”‚Β Β      β”œβ”€β”€ fixtures
β”‚Β Β      β”œβ”€β”€ packaging
β”‚Β Β      β”‚Β Β  └── deb
β”‚Β Β      β”‚Β Β      └── ckit
β”‚Β Β      β”‚Β Β          └── DEBIAN
β”‚Β Β      β”œβ”€β”€ set
β”‚Β Β      β”œβ”€β”€ static
β”‚Β Β      β”œβ”€β”€ tabutils
β”‚Β Β      β”œβ”€β”€ testdata
β”‚Β Β      └── xflag
β”œβ”€β”€ notes
β”‚Β Β  └── 2022_01_30_performance_report_files
β”œβ”€β”€ python
β”‚Β Β  β”œβ”€β”€ labe
β”‚Β Β  β”œβ”€β”€ packaging
β”‚Β Β  β”‚Β Β  └── deb
β”‚Β Β  β”‚Β Β      └── labe
β”‚Β Β  β”‚Β Β          └── DEBIAN
β”‚Β Β  └── tests
└── static

40 directories

SLOC

$ tokei -C -t=Go,Python,yaml
===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Go                     19         2666         2134          341          191
 Python                 16         2881         2345          126          410
 YAML                    4          249          193           35           21
===============================================================================
 Total                  39         5796         4672          502          622
===============================================================================

Ideas

  • stats on combined oci, refcat graph; notes
  • stats on combined oci, openalex (mag), refcat graph
  • include "cited by" count in documents; may need a separate mapping database (with about 70M rows) for (doi, cited by count) -- could also be a COUNT on oci, but may want to have separate lookup table for performance (e.g. result would be just an int; db is about 4GB)

Misc

A data web service, lightning talk on the Go side of things at Leipzig Gophers 2021-11-23