全文搜索jupyter notebook
-
-
-
https://fscrawler.readthedocs.io/en/fscrawler-2.9/
Welcome to the FS Crawler for Elasticsearch.This crawler helps to index binary documents such as PDF, Open Office, MS Office.
Main features:
Local file system (or a mounted drive) crawling and index new files, update existing ones and removes old ones.
Remote file system over SSH/FTP crawling.
REST interface to let you “upload” your binary documents to elasticsearch. -
-
-
-
https://subscription.packtpub.com/book/big+data+and+business+intelligence/9781784394936/1/ch01lvl1sec10/importing-a-json-file-into-elasticsearch
Importing a JSON file into Elasticsearchnotebooks本身就是json 是不是直接可以用这个命令加载
-
-
https://github.com/genostack/elasticsearch_loader
Main features
Batch upload CSV (actually any *SV) files to Elasticsearch
Batch upload JSON files / JSON lines to Elasticsearch
Batch upload parquet files to Elasticsearch
Pre defining custom mappings
Delete index before upload
Index documents with _id from the document itself
Load data directly from url
SSL and basic auth
Unicode Support v
