Skip to content
Snippets Groups Projects

Created data loader for sikt.no and changed the configuration to be domain...

Merged Ask Markestad requested to merge 22-create-sikt-no-data-loader into master
1 unresolved thread

Created data loader for sikt.no and changed the configuration to be domain specific and not part of rag package. Keeping the sikt-no data loader here as an example of how it is done just as a temporary step while the package is being figured out.

Closes #22 (closed)

Merge request reports

Pipeline #368306 passed

Pipeline passed for 1ed7a96b on 22-create-sikt-no-data-loader

Merged by Ask MarkestadAsk Markestad 4 months ago (Nov 18, 2024 7:59am UTC)

Loading

Pipeline #370535 passed

Pipeline passed for 365a5654 on master

Deployed to produ‎ction‎ 4 months ago
Deployed to sta‎ging‎ 4 months ago

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
1 import asyncio
2 import random
3 import sys
4 from collections.abc import Callable
5 from typing import Optional
6
7
8 class SiktConfiguration:
  • Maintainer

    Can feide configuration just be an extension of this, and this then just be a default "Configuration"?

  • Unless the plan for Configuration is something entirely different, I really think it should not even be a class. It doesn't need to be. There is some overlap in these cases of where data is gathered from, so I can see a usecase for making some utils functions available through our package for loading. But I don't think it is worth the effort to abstract and generalize this bit. It will only make the code less readable and complicate interaction with it.

    I think long term, how the data is loaded should not be a concern of our package. It will vary massively between implementations, and here I think you want to have as simple an interface as possible, and a list of Document objects as the user interface to our package seems like the best.

  • Maintainer

    Configuration should be something different, this bit is now just a data-loader and has lost most "configuration"-like properties. So perhaps that should be the change here, remove the class and just have the method get_docs here and in feide, and leave Configuration with the rest (environment reading can be pushed in there later)

  • Please register or sign in to reply
  • A.C. approved this merge request

    approved this merge request

  • Ask Markestad mentioned in commit 365a5654

    mentioned in commit 365a5654

  • Please register or sign in to reply
    Loading