cern-search-rest-api issueshttps://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues2021-07-14T15:01:18+02:00https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/20Refactor Indico mappings2021-07-14T15:01:18+02:00Pablo Panero VazquezRefactor Indico mappings- [x] Update for ES6 as done for the rest in https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/issues/18
- [x] Migrate to comply with the new invenio-search prefix as done for the rest in https://gitlab.cern.ch/webservi...- [x] Update for ES6 as done for the rest in https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/issues/18
- [x] Migrate to comply with the new invenio-search prefix as done for the rest in https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/issues/24https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/46permissions: lowercase egroups for matching2019-09-24T09:39:33+02:00Pablo Panero Vazquezpermissions: lowercase egroups for matchingIn some cases egroups might come without the same capitalization. Therefore all has to be lowered or uppered.
- [ ] Elasticsearch analyzer
- [ ] ``permissions.py``for direct GET over records.In some cases egroups might come without the same capitalization. Therefore all has to be lowered or uppered.
- [ ] Elasticsearch analyzer
- [ ] ``permissions.py``for direct GET over records.https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/49crawl JaCOW proceedings2019-11-05T10:30:06+01:00Pablo Panero Vazquezcrawl JaCOW proceedingsAll content is in the DFS site: accelconf.web.cern.ch
The StartAddress is http://accelconf.web.cern.ch/AccelConf/JACoW/proceedingsForSearch.php
The standard queries are:
domain=accelconf.web.cern.ch +author:"Wei Hou Tan" url:accelcon...All content is in the DFS site: accelconf.web.cern.ch
The StartAddress is http://accelconf.web.cern.ch/AccelConf/JACoW/proceedingsForSearch.php
The standard queries are:
domain=accelconf.web.cern.ch +author:"Wei Hou Tan" url:accelconf/ipac2019 FileExtension=pdf -url:abstract -url:accelconf/jacowhttps://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/55edms: tune up elasticsearch (performance issues)2021-07-14T15:13:11+02:00Pablo Panero Vazquezedms: tune up elasticsearch (performance issues)When sending more than one big request (~60MB) ES is sometimes timing out. At the moment the timeout has been set to 30 seconds. Need to review the sharding.When sending more than one big request (~60MB) ES is sometimes timing out. At the moment the timeout has been set to 30 seconds. Need to review the sharding.https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/60permissions: evaluate the need for egroup flag2021-07-14T15:14:16+02:00Pablo Panero Vazquezpermissions: evaluate the need for egroup flagThe use of egroups is decided from a config variable. It might not be needed. This would avoid complexity with EDMS.The use of egroups is decided from a config variable. It might not be needed. This would avoid complexity with EDMS.https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/64edms: issues with bulk indexing2019-09-13T16:11:38+02:00Pablo Panero Vazquezedms: issues with bulk indexinguWSGI get slow in bulk indexing, then nginx closes the connection. Generating an error mentioned [here](https://github.com/unbit/uwsgi/issues/1623)
Maybe related with [this](https://github.com/cernopendata/opendata.cern.ch/pull/2428/com...uWSGI get slow in bulk indexing, then nginx closes the connection. Generating an error mentioned [here](https://github.com/unbit/uwsgi/issues/1623)
Maybe related with [this](https://github.com/cernopendata/opendata.cern.ch/pull/2428/commits) issue of CERN OpenDatahttps://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/65docs [internal]: add pipeline updates to datamodel update procedure2019-09-13T16:11:30+02:00Pablo Panero Vazquezdocs [internal]: add pipeline updates to datamodel update procedurehttps://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/66logging: migrate to new invenio-logging2021-07-14T15:17:33+02:00Pablo Panero Vazquezlogging: migrate to new invenio-loggingRaven gives way to many errors. In newer versions of invenio-logging Raven has been deprecated. Need to upgrade to newer versions (Invenio 3.2 full migration?)
Some info in the changelog [here](https://inveniosoftware.org/blog/2019-08-0...Raven gives way to many errors. In newer versions of invenio-logging Raven has been deprecated. Need to upgrade to newer versions (Invenio 3.2 full migration?)
Some info in the changelog [here](https://inveniosoftware.org/blog/2019-08-07-sprint-update/)https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/67edms: query relevance debugging2021-07-14T15:19:43+02:00Pablo Panero Vazquezedms: query relevance debuggingSome queries do not produce the expected results:
**EDMS dev**
``` console
?access=en-dep-ace-plm@cern.ch&q=NOT-0010+AND+_data.object_type:D+AND+_data.obsolete:false&size=100
```
Returns the firsts results just with ``NOT`` in the tit...Some queries do not produce the expected results:
**EDMS dev**
``` console
?access=en-dep-ace-plm@cern.ch&q=NOT-0010+AND+_data.object_type:D+AND+_data.obsolete:false&size=100
```
Returns the firsts results just with ``NOT`` in the title (e.g. 59355). However, records such as 142684 containing ``NOT-0010`` appear much later in the results (having to set up size=100 to see it).
``` console
?access=en-dep-ace-plm@cern.ch&q=(_data.author.full_name:(nikitas.kotsolakos@cern.ch)+OR+_data.author.email:(nikitas.kotsolakos@cern.ch))+AND+_data.object_type:D+AND+_data.obsolete:false&page=1&size=100
```
First returns results with no match, then the revers ``kotsolakos.nikitas@cern.ch`` and only later on some with the queried email.https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/76docs: instance configuration2019-10-11T11:42:04+02:00Pablo Panero Vazquezdocs: instance configurationDocument the instance configuration (env vars) as part of the deployment process.Document the instance configuration (env vars) as part of the deployment process.https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/90Logging: Debug logs broken inside docker / oc2019-11-21T11:24:03+01:00Carina AntunesLogging: Debug logs broken inside docker / ocFlags don't seem to work. Investigate and fix.Flags don't seem to work. Investigate and fix.https://gitlab.cern.ch/webservices/cern-search/cern-search-rest-api/-/issues/109Python tutorial2022-03-17T16:06:12+01:00Carina AntunesPython tutorialPython
- [ ] Depending on level follow [Python tutorial](https://www.python-course.eu/python3_course.php) and/or [OOP](https://www.python-course.eu/object_oriented_programming_introduction.php) and/or [Advanced](https://www.python-cours...Python
- [ ] Depending on level follow [Python tutorial](https://www.python-course.eu/python3_course.php) and/or [OOP](https://www.python-course.eu/object_oriented_programming_introduction.php) and/or [Advanced](https://www.python-course.eu/advanced_python.php)
From Python Tutorial at least
- https://www.python-course.eu/python3_passing_arguments.php
From OOP at least
- Properties vs. Getters and Setters
- Implementing a Property Decorator
- Multiple Inheritance
- Magic Methods and Operator Overloading
From Advanced at least:
- Iterators and Generators
- Lambda Operator, Filter, Reduce and Map
- zip Tutorial
- Decorators
- Memoization with Decorators
- List Comprehension
Tests
- [ ] https://docs.pytest.org/en/latest/how-to/fixtures.html
- [ ] https://docs.pytest.org/en/latest/how-to/parametrize.html
- [ ] https://docs.python.org/3/library/unittest.mock-examples.htmlNoor Afshan FathimaNoor Afshan Fathima