csv 파일의 빈 셀을 0으로 채우고 헤더 없이 셀을 제거하는 방법은 무엇입니까?

2024-5-29 • tag-icon

csv 파일의 빈 셀을 0으로 채우고 헤더 없이 셀을 제거하는 방법은 무엇입니까?

빈 셀과 헤더가 없는 셀이 포함된 Cassandra 데이터베이스로 정말 역겨운 csv 파일을 가져오고 싶습니다. 나는 노력했다

#!/bin/bash

for i in $( seq 1 2); do
  sed -e "s/^,/$2,/" -e "s/,,/,$2,/g" -e "s/,$/,$2/" -i $1
done

그런데 적용해 보니 별 효과가 없는 것 같아요

bash fill-empty-values.sh bouffe500.csv 0

나는 chmod +x그렇게하지 않았습니다.

다음은 스크립트를 적용한 후의 bouffe500.csv 이미지입니다. 여전히 빈 필드가 있습니다.

Cassandra로 csv 파일을 가져오려고 할 때 빈 필드가 문제를 일으킬 것이라고 생각했고 다음과 같은 답변을 받았습니다.

cqlsh:k1> COPY k1.bouffe FROM 'bouffe_v2.csv' WITH HEADER=true;
Using 3 child processes

Starting copy of k1.bouffe with columns [code, additives, additives_fr, ...
...
Failed to import 1 rows: ParseError - Invalid row length 112 should be 163,  given up without retries
Failed to import 7 rows: ParseError - Invalid row length 60 should be 163,  given up without retries
Failed to import 1 rows: ParseError - Invalid row length 74 should be 163,  given up without retries
Failed to import 2 rows: ParseError - Invalid row length 32 should be 163,  given up without retries
Failed to import 5 rows: ParseError - Invalid row length 31 should be 163,  given up without retries
Failed to import 1 rows: ParseError - Invalid row length 85 should be 163,  given up without retries
Failed to import 1 rows: ParseError - Invalid row length 111 should be 163,  given up without retries
Failed to import 3 rows: ParseError - Invalid row length 81 should be 163,  given up without retries
Failed to process 499 rows; failed rows written to import_k1_bouffe.err
Processed: 499 rows; Rate:     801 rows/s; Avg. rate:    1211 rows/s
499 rows imported from 1 files in 0.412 seconds (0 skipped).
cqlsh:k1> SELECT * from bouffe;

관련 정보