Error: Trying to access flag --preserve_unused_tokens before flags were parsed #1133

rv-ltran · 2020-08-07T16:12:48Z

I have been using the following code fine until this morning. I got an error for using bert.run_classifier.convert_examples_to_features(test_InputExamples, label_list, MAX_SEQ_LENGTH, tokenizer)

Please let me know how to fix it

import pandas as pd
import bert
from bert import run_classifier
from bert import optimization
from bert import tokenization
from tensorflow.contrib import predictor
import pkg_resources
pkg_resources.get_distribution("bert-tensorflow").version


input_words = "Hello"

DATA_COLUMN = "message"
LABEL_COLUMN = "category_label"


test = pd.DataFrame({DATA_COLUMN: [input_words], LABEL_COLUMN : [0]})

BERT_MODEL_HUB = "https://tfhub.dev/google/bert_uncased_L-12_H-768_A-12/1"

def create_tokenizer_from_hub_module():
  """Get the vocab file and casing info from the Hub module."""
  with tf.Graph().as_default():
    bert_module = hub.Module(BERT_MODEL_HUB)
    tokenization_info = bert_module(signature="tokenization_info", as_dict=True)
    with tf.Session() as sess:
      vocab_file, do_lower_case = sess.run([tokenization_info["vocab_file"],
                                            tokenization_info["do_lower_case"]])

  return bert.tokenization.FullTokenizer(
      vocab_file=vocab_file, do_lower_case=do_lower_case)

tokenizer = create_tokenizer_from_hub_module()

test_InputExamples = test.apply(lambda x: bert.run_classifier.InputExample(guid=None, 
                                                               text_a = x[DATA_COLUMN], 
                                                               text_b = None, 
                                                               label = x[LABEL_COLUMN]), axis = 1)

# We'll set sequences to be at most 128 tokens long.
MAX_SEQ_LENGTH = 128
label_list = [6,1,2,4,3,5,0]
# Convert our test features to InputFeatures that BERT understands.
test_features = bert.run_classifier.convert_examples_to_features(test_InputExamples, label_list, MAX_SEQ_LENGTH, tokenizer)

Error:

INFO:tensorflow:Writing example 0 of 1
INFO:tensorflow:Writing example 0 of 1
UnparsedFlagAccessError: Trying to access flag --preserve_unused_tokens before flags were parsed.
---------------------------------------------------------------------------
UnparsedFlagAccessError                   Traceback (most recent call last)
<command-35675914> in <module>
     16 label_list = [6,1,2,4,3,5,0]
     17 # Convert our test features to InputFeatures that BERT understands.
---> 18 test_features = bert.run_classifier.convert_examples_to_features(test_InputExamples, label_list, MAX_SEQ_LENGTH, tokenizer)
     19 
     20 input_ids_list = [x.input_ids for x in test_features]

/databricks/python/lib/python3.7/site-packages/bert/run_classifier.py in convert_examples_to_features(examples, label_list, max_seq_length, tokenizer)
    778 
    779     feature = convert_single_example(ex_index, example, label_list,
--> 780                                      max_seq_length, tokenizer)
    781 
    782     features.append(feature)

/databricks/python/lib/python3.7/site-packages/bert/run_classifier.py in convert_single_example(ex_index, example, label_list, max_seq_length, tokenizer)
    394     label_map[label] = i
    395 
--> 396   tokens_a = tokenizer.tokenize(example.text_a)
    397   tokens_b = None

The text was updated successfully, but these errors were encountered:

liuyibox · 2020-08-07T20:05:55Z

I got same error, and it just happens today.

AMZzee · 2020-08-08T14:29:51Z

Hey, this is an error caused due to a recent version update in bert.
Change pip install bert-tensorflow to pip install bert-tensorflow==1.0.1
This will solve the error by installing the previous version.
You can use the previous version till the developers fix this issue.

rv-ltran · 2020-08-10T13:46:33Z

Do you know when will it be fixed by the developer?

deeptimittal97 · 2020-08-14T01:45:20Z

I am getting same error, any progress on this?

GauravSahani1417 · 2020-08-18T10:44:21Z

Hey, this is an error caused due to a recent version update in bert.
Change pip install bert-tensorflow to pip install bert-tensorflow==1.0.1
This will solve the error by installing the previous version.
You can use the previous version till the developers fix this issue.

Degraded the bert-tensorflow version, resolved!

jasser94 · 2020-08-21T09:17:02Z

I got the same error today and downgrading bert-tf to 1.0.1 didn't fix it any other ideas?
tf version 1.15.2
python version 3.6.10
anaconda environment in ubuntu OS 18.04

dpinol · 2020-09-23T08:46:53Z

For me, downgrading bert-tensorflow from 1.0.4 to 1.0.1 solved the issue.
I'm retraining a model from colab

unre4l · 2020-09-29T14:10:49Z

Any news here? Downgrading can't be the final solution 🤔

ekeleshian · 2020-10-05T18:45:06Z

For me, downgrading bert-tensorflow to 1.0.1 and downgrading tensorflow to 2.0.0 worked but w/ one work around.
context: after downgrading the libraries and running my script, error was thrown: with tf.gfile.GFile(vocab_file, "r") as reader: AttributeError: module 'tensorflow' has no attribute 'gfile'
So to fix this, I edited the code in site-packages/bert/tokenization.py where gfile was being called and replaced it with with tf.io.gfile.Gfile(vocab_file, "r") as reader
it's not the cleanest, but it worked for me.

DerekGrant · 2020-12-03T08:22:40Z

Hey, this is an error caused due to a recent version update in bert.
Change pip install bert-tensorflow to pip install bert-tensorflow==1.0.1
This will solve the error by installing the previous version.
You can use the previous version till the developers fix this issue.

Thanks, it helps a lot!

ZetiMente · 2021-01-12T03:54:00Z

Is Google really using Bert in their search engine but ignoring updating it to TensorFlow 2.4? I have the error AttributeError: module 'tensorflow_estimator.python.estimator.api._v1.estimator' has no attribute 'TPUEstimator' and a search from Google led me here.

yuvrajeyes · 2021-06-17T08:30:26Z

same issue

# train - Embedding # 分割 dataset - train + validation # BERT # 不明原因train完 1 epoch會卡住 # main ref: https://www.kaggle.com/gunesevitan/nlp-with-disaster-tweets-eda-cleaning-and-bert#3.-Target-and-N-grams # ref: google-research/bert#1133

ahmedhassen7 · 2021-09-15T12:51:19Z

I got the same error today and downgrading bert-tf to 1.0.1 didn't fix it any other ideas?
tf version 1.15.2
python version 3.6.10
anaconda environment in ubuntu OS 18.04

Actually you should restart the kernel

kawthar-eltarr · 2021-12-19T16:18:37Z

I'm having the exact same issue. When I try downgrading to 1.0.1 version I have this error :

with tf.gfile.GFile(vocab_file, "r") as reader: AttributeError: module 'tensorflow' has no attribute 'gfile'

Does anyone know any solution for this please ?

paramdutta · 2022-04-25T17:55:22Z

Hello Friends,
I had this issue with tf 2.8 and bert 1.0.4 as well. I just stick these lines before the offending call:
import sys
sys.argv=['preserve_unused_tokens=False'] #Or true, if you like
flags.FLAGS(sys.argv)

Cheers!

AnandVamsi1993 · 2022-05-04T01:32:45Z

Hello Friends, I had this issue with tf 2.8 and bert 1.0.4 as well. I just stick these lines before the offending call: import sys sys.argv=['preserve_unused_tokens=False'] #Or true, if you like flags.FLAGS(sys.argv)

Cheers!

what is the 'flags' variable?

ahmadharimukti · 2022-05-17T15:55:22Z

Trying to access flag --preserve_unused_tokens before flags were parsed.
im still stuck at this

downgraden not really impact to me

sa5r · 2022-05-25T00:16:16Z

I set the flag manually. Not sure this is right but made my code work.

import sys
from absl import flags
sys.argv=['preserve_unused_tokens=False']
flags.FLAGS(sys.argv)

honzikv · 2022-07-22T08:23:24Z

I set the flag manually. Not sure this is right but made my code work.
import sys
from absl import flags
sys.argv=['preserve_unused_tokens=False']
flags.FLAGS(sys.argv)

This fixes the issue

Rask133 · 2023-02-13T05:49:18Z

I set the flag manually. Not sure this is right but made my code work.
import sys
from absl import flags
sys.argv=['preserve_unused_tokens=False']
flags.FLAGS(sys.argv)

Hey thanks,it worked for me

xlsdust · 2023-04-12T03:00:00Z

Hello Friends, I had this issue with tf 2.8 and bert 1.0.4 as well. I just stick these lines before the offending call: import sys sys.argv=['preserve_unused_tokens=False'] #Or true, if you like flags.FLAGS(sys.argv)

Cheers!

thankyou,it fixes the issue

liuyibox mentioned this issue Aug 10, 2020

module 'tensorflow_estimator.python.estimator.api._v1.estimator.tpu' has no attribute 'CrossShardOptimizer' #1135

Closed

jouniluoma mentioned this issue Nov 23, 2020

error absl.flags._exceptions.UnparsedFlagAccessError: Trying to access flag --preserve_unused_tokens before flags were parsed. jouniluoma/keras-bert-ner#6

Closed

psybuzz mentioned this issue Jan 23, 2021

Unable to run TensorBoard after TF v1.4 upgrade tensorflow/tensorboard#764

Closed

HillZhang1999 mentioned this issue Dec 25, 2022

关于segment_bert.py HillZhang1999/SynGEC#16

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error: Trying to access flag --preserve_unused_tokens before flags were parsed #1133

Error: Trying to access flag --preserve_unused_tokens before flags were parsed #1133

rv-ltran commented Aug 7, 2020 •

edited

Loading

liuyibox commented Aug 7, 2020 •

edited

Loading

AMZzee commented Aug 8, 2020 •

edited

Loading

rv-ltran commented Aug 10, 2020

deeptimittal97 commented Aug 14, 2020

GauravSahani1417 commented Aug 18, 2020

jasser94 commented Aug 21, 2020

dpinol commented Sep 23, 2020

unre4l commented Sep 29, 2020

ekeleshian commented Oct 5, 2020

DerekGrant commented Dec 3, 2020

ZetiMente commented Jan 12, 2021

yuvrajeyes commented Jun 17, 2021

ahmedhassen7 commented Sep 15, 2021

kawthar-eltarr commented Dec 19, 2021

paramdutta commented Apr 25, 2022 •

edited

Loading

AnandVamsi1993 commented May 4, 2022

ahmadharimukti commented May 17, 2022

sa5r commented May 25, 2022

honzikv commented Jul 22, 2022

Rask133 commented Feb 13, 2023

xlsdust commented Apr 12, 2023

Error: Trying to access flag --preserve_unused_tokens before flags were parsed #1133

Error: Trying to access flag --preserve_unused_tokens before flags were parsed #1133

Comments

rv-ltran commented Aug 7, 2020 • edited Loading

liuyibox commented Aug 7, 2020 • edited Loading

AMZzee commented Aug 8, 2020 • edited Loading

rv-ltran commented Aug 10, 2020

deeptimittal97 commented Aug 14, 2020

GauravSahani1417 commented Aug 18, 2020

jasser94 commented Aug 21, 2020

dpinol commented Sep 23, 2020

unre4l commented Sep 29, 2020

ekeleshian commented Oct 5, 2020

DerekGrant commented Dec 3, 2020

ZetiMente commented Jan 12, 2021

yuvrajeyes commented Jun 17, 2021

ahmedhassen7 commented Sep 15, 2021

kawthar-eltarr commented Dec 19, 2021

paramdutta commented Apr 25, 2022 • edited Loading

AnandVamsi1993 commented May 4, 2022

ahmadharimukti commented May 17, 2022

sa5r commented May 25, 2022

honzikv commented Jul 22, 2022

Rask133 commented Feb 13, 2023

xlsdust commented Apr 12, 2023

rv-ltran commented Aug 7, 2020 •

edited

Loading

liuyibox commented Aug 7, 2020 •

edited

Loading

AMZzee commented Aug 8, 2020 •

edited

Loading

paramdutta commented Apr 25, 2022 •

edited

Loading