Skip to the content.

Seret Indexer is a Python-based web scraping and search project that fetches movie data from seret.co.il and from edb.co.il and indexes it into Elasticsearch for offline and efficient searching.

It uses Github Actions to run the indexer periodically and save the data to Docker images.

Searching

The results can be found in the docker image ghcr.io/tal-sitton/seret-search:latest.

So all you need to do is run the following command:

docker network create -d bridge elastic-net
docker run -p 9200:9200 --network elastic-net ghcr.io/tal-sitton/seret-search:latest
docker run -p 5601:5601 --network elastic-net docker.elastic.co/kibana/kibana:8.0.0

and you can access the elastic in port 9200, and the kibana in port 5601.

To run it yourself:

Prerequisites

Installation

  1. Clone the repository:
    git clone https://github.com/tal-sitton/seret-indexer
    
  2. Install the required Python packages:
    pip install -r requirements.txt
    

Usage

  1. Start the Elasticsearch (and if you want, the Kibana) services using Docker Compose:
    docker compose -f elastic/docker-compose.yml up -d elasticsearch kibana
    
  2. Run the main Python script:
    python main.py
    

Project Structure