fill down by record in Google Refine

757 Views Asked by At

I have the following comma-delimited CSV file in Google refine:

enter image description here

How do I fill down the values from column1 using Jython or GREL to become:

enter image description here

I have tried:

if value is None:
 return row["record"]["cells"]["column1"]["value"][0]
else:
 return value

Any suggestions? thank you

1

There are 1 best solutions below

0
On

The tricky part of your example is the record id 1 as you have two different values in column1 within the same record.

So to complete this you have to

  1. Create a new index by merging your ID and column1 field (see tutorial)
  2. Use the expression row.record.cells["Column1"].value[0] to fill down (see tutorial)

I've successfully done it in 6 steps here is the JSON code to apply to your project:

    [
  {
    "op": "core/column-addition",
    "description": "Create column ID2 at index 2 based on column Column1 using expression grel:cells['ID'].value+value",
    "engineConfig": {
      "facets": [],
      "mode": "record-based"
    },
    "newColumnName": "ID2",
    "columnInsertIndex": 2,
    "baseColumnName": "Column1",
    "expression": "grel:cells['ID'].value+value",
    "onError": "set-to-blank"
  },
  {
    "op": "core/text-transform",
    "description": "Text transform on cells in column ID2 using expression grel:cells['ID'].value",
    "engineConfig": {
      "facets": [
        {
          "invert": false,
          "expression": "isBlank(value)",
          "selectError": false,
          "omitError": false,
          "selectBlank": false,
          "name": "ID",
          "omitBlank": false,
          "columnName": "ID",
          "type": "list",
          "selection": [
            {
              "v": {
                "v": false,
                "l": "false"
              }
            }
          ]
        },
        {
          "invert": false,
          "expression": "isBlank(value)",
          "selectError": false,
          "omitError": false,
          "selectBlank": false,
          "name": "Column1",
          "omitBlank": false,
          "columnName": "Column1",
          "type": "list",
          "selection": [
            {
              "v": {
                "v": true,
                "l": "true"
              }
            }
          ]
        }
      ],
      "mode": "row-based"
    },
    "columnName": "ID2",
    "expression": "grel:cells['ID'].value",
    "onError": "keep-original",
    "repeat": false,
    "repeatCount": 10
  },
  {
    "op": "core/text-transform",
    "description": "Text transform on cells in column ID2 using expression grel:cells['Column1'].value",
    "engineConfig": {
      "facets": [
        {
          "invert": true,
          "expression": "value",
          "selectError": false,
          "omitError": false,
          "selectBlank": true,
          "name": "Column1",
          "omitBlank": false,
          "columnName": "Column1",
          "type": "list",
          "selection": [
            {
              "v": {
                "v": " ",
                "l": " "
              }
            }
          ]
        },
        {
          "invert": false,
          "expression": "isBlank(value)",
          "selectError": false,
          "omitError": false,
          "selectBlank": false,
          "name": "ID",
          "omitBlank": false,
          "columnName": "ID",
          "type": "list",
          "selection": [
            {
              "v": {
                "v": true,
                "l": "true"
              }
            }
          ]
        }
      ],
      "mode": "row-based"
    },
    "columnName": "ID2",
    "expression": "grel:cells['Column1'].value",
    "onError": "keep-original",
    "repeat": false,
    "repeatCount": 10
  },
  {
    "op": "core/column-move",
    "description": "Move column ID2 to position 0",
    "columnName": "ID2",
    "index": 0
  },
  {
    "op": "core/text-transform",
    "description": "Text transform on cells in column Column1 using expression grel:row.record.cells[\"Column1\"].value[0]",
    "engineConfig": {
      "facets": [],
      "mode": "record-based"
    },
    "columnName": "Column1",
    "expression": "grel:row.record.cells[\"Column1\"].value[0]",
    "onError": "keep-original",
    "repeat": false,
    "repeatCount": 10
  },
  {
    "op": "core/column-removal",
    "description": "Remove column ID2",
    "columnName": "ID2"
  }
]